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Preface to the Expanded Edition 


Besides the correction of typos, the most important change is the inclusion 
of over 300 exercises to go with these 24 chapters. Answers are given to 
all exercises, but not the solutions. They were chosen to test the material 
covered in this book. This was done in response to calls from students 
and instructors. A similar inclusion will be made in the next edition of the 
companion volume, Fundamentals of Physics II. 


Preface to the First Edition 


Given that the size of textbooks has nearly tripled during my own career, 
without a corresponding increase in the cranial dimensions of my students, 
I have always found it necessary, like my colleagues elsewhere, to cull the 
essentials into a manageable size. I did that in the course Fundamentals 
of Physics I taught at Yale, and this book preserves that feature. It covers 
the fundamental ideas of Newtonian mechanics, relativity, fluids, waves, 
oscillations, and thermodynamics without compromise. It requires only 
the basic notions of differentiation and integration, which I often review 
as part of the lectures. It is aimed at college students in physics, chemistry, 
and engineering as well as advanced high school students and independent 
self-taught learners at various stages in life, in various careers. 

The chapters in the book more or less follow my 24 lectures, 
with a few minor modifications. The style preserves the classroom 
atmosphere. Often I introduce the questions asked by the students 
or the answers they give when I believe they will be of value to 
the reader. The simple figures serve to communicate the point with- 
out driving up the price. The equations have been typeset and are 
a lot easier to read than in the videos. The problem sets and ex- 
ams, without which one cannot learn or be sure one has learned the 
physics, may be found along with their solutions at the Yale website, 
http://oyc.yale.edu/physics, free and open to all. The lectures may also be 
found at venues such as YouTube, iTunes (https://itunes.apple.com/us/ 
itunes-u/physics-video/id341651848%’mt=10), and Academic Earth, to 
name a few. 

The book, along with the material available at the Yale website, may 
be used as a stand-alone resource for a course or self-study, though some 
instructors may prescribe it as a supplement to another one adapted for 
the class, so as to provide a wider choice of problems or more worked 
examples. 

To my online viewers I say, “You have seen the movie; now read the 
book!” The advantage of having the printed version is that you can read it 
during take-off and landing. 


Preface to the First Edition xv 


In the lectures I sometimes refer to my Basic Training in Mathematics, 
published by Springer and intended for anyone who wants to master the 
undergraduate mathematics needed for the physical sciences. 

This book owes its existence to many people. It all began when Peter 
Salovey, now President, then Dean of Yale College, asked me if I minded 
having cameras in my Physics 200 lectures to make them part of the first 
batch of Open Yale Courses, funded by the Hewlett Foundation. Since my 
answer was that I had yet to meet a camera I did not like, the taping be- 
gan. The key person hereafter was Diana E. E. Kleiner, Dunham Professor, 
History of Art and Classics, who encouraged and guided me in many ways. 
She was also the one who persuaded me to write this book. Initially reluc- 
tant, I soon found myself thoroughly enjoying proselytizing my favorite 
subject in this new format. At Yale University Press, Joe Calamia was my 
friend, philosopher, and guide. Liz Casey did some very skilled editing. Be- 
sides correcting errors in style (such as a long sentence that began in first 
person past tense and ended in third person future tense) and matters of 
grammar and punctuation (which I sprinkle pretty much randomly), she 
also made sure my intent was clear in every sentence. 

Barry Bradlyn and Alexey Shkarin were two graduate students and 
Qiwei Claire Xue and Dennis Mou were two undergraduates who proof- 
read earlier versions. 

My family, from my wife, Uma, down to little Stella, have encouraged 
me in various ways. 

I take this opportunity to acknowledge my debt to the students at Yale 
who, over nearly four decades, have been the reason I jump out of bed on 
two or three days a week. I am grateful for their friendship and curiosity. 
In recent years, they were often non-majors, willing to be persuaded that 
physics was a fascinating subject. This I never got tired of doing, thanks to 
the nature of the subject and the students. 
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CHAPTER 1 


The Structure of Mechanics 


1.1 Introduction and some useful tips 


This book is based on the first half of a year-long course that introduces 
you to all the major ideas in physics, starting from Galileo and Newton, 
right up to the big revolutions of the twentieth century: relativity and 
quantum mechanics. The target audience for this course and book is really 
very broad. In fact, I have always been surprised by the breadth of interests 
of my students. I don’t know what you are going to do later in life, so I have 
picked the topics that all of us in physics find fascinating. Some may not 
be useful, but you just don’t know. Some of you are probably going to be 
doctors, and you don’t know why I’m going to cover special relativity or 
quantum mechanics. Well, if youre a doctor and you have a patient who’s 
running away from you at the speed of light, you'll know what to do. Or, if 
youre a pediatrician, you will understand why your patient will not sit still: 
the laws of quantum mechanics don’t allow a very small object to have a 
definite position and momentum. Whether or not you become a physicist, 
you should certainly learn about these great strides in the human attempt 
to understand the physical world. 

Most textbooks are about 1,200 pages long, but when I learned 
physics they were around 400 pages long. When I look around, I don’t 
see any student whose head is three times as big as mine, so I know that 
you cannot digest everything the books have. I take what I think are the 
really essential parts and cover them in these lectures. So you need the lec- 
tures to find out what’s in the syllabus and what’s not. If you don’t do that, 
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there’s a danger you will learn something you don’t have to, and we don’t 
want that, right? 

To learn physics well, you have to do the problems. If you watch me 
online doing things on the blackboard or working through derivations in 
the book, it all looks very reasonable. It looks like you can do it yourself 
and that you understand what is going on, but the only way youre go- 
ing to find out is by actually doing problems. A fair number are available, 
with their solutions, at http://oyc.yale.edu/physics/phys-200, and over 300 
(with answers but not solutions) in the exercises and answers sections at 
the back of this book. You don’t have to do them by yourself. That’s not 
how physics is done. I am now writing a paper with two other people. My 
experimental colleagues write papers with four hundred or even a thou- 
sand other people when engaged in the big collider experiments like the 
ones in Geneva or Fermilab. It’s perfectly okay to be part of a collaboration, 
but you have to make sure that you're pulling your weight, that everybody 
makes contributions to finding the solution and understands it. 

This calculus-based course assumes you know the rudiments of dif- 
ferential and integral calculus, such as functions, derivatives, derivatives 
of elementary functions, elementary integrals, changing variables in inte- 
grals, and so on. Later, I will deal with functions of more than one variable, 
which I will briefly introduce to you, because that is not a prerequisite. You 
have to know your trigonometry, to know what’s a sine and what’s a co- 
sine and some simple identities. You cannot say, “I will look it up.” Your 
birthday and social security number are things you look up; trigonometric 
functions and identities are what you know all the time. 


1.2 Kinematics and dynamics 


We are going to be studying Newtonian mechanics. Standing on the shoul- 
ders of his predecessors, notably Galileo, Isaac Newton placed us on the 
road to understanding all the mechanical phenomena for centuries until 
the laws of electromagnetism were discovered, culminating in Maxwell’s 
equations. Our concern here is mechanics, which is the motion of billiard 
balls and trucks and marbles and whatnot. You will find out that the laws 
of physics for this entire semester can be written down on the back of 
an envelope. A central purpose of this course is to show you repeatedly 
that starting with those few laws, you can deduce everything. I would en- 
courage you to think the way physicists do, even if you don’t plan to be a 
physicist. The easiest way to master this subject is to follow the reasoning I 
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give you. That way, you don’t have to store too many things in your head. 
Early on, when there are four or five formulas, you can memorize all of 
them and you can try every one of them until something works, but, after 
a couple of weeks, you will have hundreds of formulas, and you cannot 
memorize all of them. You cannot resort to trial and error. You have to 
know the logic. 

The goal of physics is to predict the future given the present. We will 
pick some part of the universe that we want to study and call it “the sys- 
tem,” and we will ask, “What information do we need to know about that 
system at the initial time, like right now, in order to be able to predict its 
future evolution?” If I throw a piece of candy at you and you catch it, that’s 
an example of Newtonian mechanics at work. What did I do? I threw a 
piece of candy from my hand, and the initial conditions are where I re- 
leased it and with what velocity. That’s what you see with your eyes. You 
know it’s going to go up, it’s going to follow some kind of parabola, and 
your hands get to the right place at the right time to receive it. That is an 
example of Newtonian mechanics at work, and your brain performed the 
necessary calculations effortlessly. 

You only have to know the candy’s initial location and the initial ve- 
locity. The fact that it was blue or red is not relevant. If I threw a gorilla 
at you, its color and mood would not matter. These are things that do not 
affect the physics. Ifa guy jumps off a tall building, we want to know when, 
and with what speed, he will land. We don’t ask why this guy is ending it 
all today; that is a question for the psych department. So we don’t answer 
everything. We ask very limited questions about inanimate objects, and we 
brag about how accurately we can predict the future. 

The Newtonian procedure for predicting the future, given the 
present, has two parts, kinematics and dynamics. Kinematics is a complete 
description of the present. It’s a list of what you have to know about a sys- 
tem right now. For example, if you're talking about a piece of chalk, you 
will want to know where it is and how fast it’s moving. Dynamics then 
tells you why the chalk goes up, why it goes down, and so on. It comes 
down due to the force of gravity. In kinematics, you don’t ask for the rea- 
son behind anything. You simply want to describe things the way they are, 
and then dynamics tells you how and why that description changes with 
time. 

Pm going to illustrate the idea of kinematics by following my pre- 
ferred approach: starting with the simplest possible example and slowly 
adding bells and whistles to make it more and more complicated. In the 


4 The Structure of Mechanics 


initial stages, some of you might say, “Well, I have seen this before, so 
maybe there is nothing new here.” That may well be true. I don’t know how 
much you have seen, but it is likely that the way you learned physics in high 
school is different from the way professional physicists think about it. Our 
priorities, and the things that we get excited about, are often different; and 
the problems will be more difficult. 


1.3 Average and instantaneous quantities 


We are going to study an object that is a mathematical point. It has no 
size. If you rotate it, it will look the same, unlike a potato, which will look 
different upon rotation. It is not enough to just say where the potato is; 
you have to say which way its nose is pointing. The study of such extended 
bodies comes later. Right now, we want to study an entity that has no spa- 
tial extent, a dot. It can move around all over space. Were going to simplify 
that too. We’re going to take an entity that moves only along the x-axis. So 
you can imagine a bead with a straight wire going through it, which allows 
it to only slide back and forth. This is about the simplest thing. I cannot 
reduce the number of dimensions. I cannot make the object simpler than 
a mathematical point. 

To describe what the point is doing, we pick an origin, call it x = 0, 
and put some markers along the x-axis to measure distance. Then we will 
say this guy is sitting at x = 5. Now, of course, we have to have units and the 
unit for length is going to be the meter. The unit for time will be a second. 
Sometimes I might not write the units, but I have earned the right to do 
that and you haven't. Everything has got to be in the right units. If you 
don’t have the units, and if you say the answer is 42, then we don't know if 
you are right or wrong. 

Back to the object. At a given instant, it’s got a location. We would like 
to describe the object’s motion by plotting a graph of space versus time. A 
typical graph would be something like Figure 1.1. Even though the plot is 
going up and down, the object is moving horizontally, back and forth along 
the spatial x-axis. When it is at A, it’s crossing the origin from the left and 
going to the right. Later, at B, it is crossing back to the left. In the language 
of calculus, x is a function of time, x = x(t), and the graph corresponds 
to some generic function that doesn’t have a name. We will also encounter 
functions that do have a name, like x(t) = t, x(t) = t, x(t) = sin t, cost, 
and so on. 
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Figure 1.1 Trajectory of a particle. The position, x(t), is measured vertically and 
the time, t, is measured horizontally. 


Consider v, the average velocity of an object, given by 


5 — Xba) — x(h) (1.1) 
n-th 
where t > tı are two times between which we have chosen to average the 
velocity. In the example in Figure 1.1, v < 0 for the indicated choice of tı 
and h since the final x(t2) is less than the initial x(t). 

The average velocity may not tell you the whole story. For example, 
if you started at x(t,) and at time tı ended up at point C with the same 
coordinate, the average velocity would be zero, which is the average you 
would get if the particle had never moved! 

The average acceleration, a, involves a similar difference of velocities: 

a= v(tr) = vl) (1.2) 

h— t 

Now for an important concept, the velocity at a given time or instan- 
taneous velocity, v(t). Figure 1.1 shows some particle moving a distance 
Ax between times t and t + At. The average velocity in that interval is 
_ What you want is the velocity at time t. We all have an intuitive no- 
tion of velocity right now. When you're driving your car, if the needle 
says 60 miles per hour, that’s your velocity at that instant. Though veloc- 
ity seems to involve two different times in its very definition—the initial 
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time and the final time—we want to talk about the velocity right now. That 
is obtained by examining the position now and the position slightly later, 
and taking the ratio of the change in position to the time elapsed between 
the two events, while bringing the two points closer and closer in time. We 
see in the figure that when we do this, both Ax — 0 and At —> 0, but their 
ratio becomes the tangent of the angle 0, shown in Figure 1.1. Thus the 
velocity at the instant t is: 


Ax dx 
v(t)= lim — = —. 1.3 
( ) At>0 At dt ( ) 
Once you take one derivative, you can take any number of deriva- 
tives. The derivative of the velocity is the acceleration, and we write it as 
the second derivative of position: 
dv &@x 
alt) = — = —. 1.4 
9 dt dt? ea) 


You are supposed to know the derivatives of simple functions like x(t) = 
t” (2 = nt”™!), as well as derivatives of sines, cosines, logarithms, and 
exponentials. If you don’t know them, you should fix that weakness before 


proceeding. 


1.4 Motion at constant acceleration 


We are now going to focus on problems in which the acceleration a(t) is 
just a constant denoted by a, with no time argument. This is not the most 
general motion, but a very relevant one. When things fall near the surface 
of the earth, they all have the same acceleration, a = —9.8 ms? = —g. 
If I tell you that a particle has a constant acceleration a, can you tell me 
what the position x(t) is? Your job is to guess a function x(t) whose second 
derivative is a. This is called integration, which is the opposite of differenti- 
ation. Integration is not an algorithmic process like differentiation, though 
it is governed by many rules that allow us to map a given problem into 
others with a known solution. If I give you a function, you know how to 
take the derivative: change the independent variable, find the change in 
the function, divide by the change in the independent variable, take the 
ratio as all changes approach zero. The opposite has to be done here. The 
way we do that is we guess, and such guessing has been going on for three 
hundred years, and we have become very good at it. The successful guesses 
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are published as the Table of Integrals. I have a copy of such a table at 
home, at work, and even in my car in case there is a breakdown. 

So, let me guess aloud. I want to find a function that reduces to the 
number a when I take two derivatives. I know that each time I take a 
derivative, I lose a power of t. In the end, I don’t want any powers of t. 
It’s clear I have to start with a function that looks like t?. Well, unfortu- 
nately, we know f? is not the right answer, because the second derivative is 
2, while I want to get a. So I multiply the original guess by ža and I know 
x(t) = sat? will have a second derivative a. 

This certainly describes a particle with an acceleration a. But is this 
the most general answer? You all know that it is just one of many: for ex- 
ample, I can add to this answer some number, say 96, and the answer will 
still have the property that if you take two derivatives, you get the same ac- 
celeration. Now 96 is a typical constant, so I’m going to give the name c to 
that constant. We know from basic calculus that in finding a function with 
a given derivative, you can always add a constant to any one answer to get 
another answer. But if you only fix the second derivative, you can also add 
anything with one power of t in it, because the extra part will get wiped 
out when you take two derivatives. If you fixed only the third derivative of 
the function, you can also add something quadratic in t without changing 
the outcome. 

So the most general expression for the position of a particle with 
constant acceleration a is 


1 
x(t) = sat tht te (1.5) 


where b, like c, is a constant that can be anything. 

Remember that x(t) in the figure describes a particle going from side 
to side. I can also describe a particle going up and down. If I do that, I 
would like to call the vertical coordinate y(t). You have to realize that in 
calculus, the symbols that you call x and y are arbitrary. If you know the 
second derivative of y to be a, then the answer is 


1 
y(t) = sat + bt +c. (1.6) 


Let me go back now to Eqn. 1.5. It is true, mathematically, you can add 
bt + c as we did, but you have to ask yourself, “What am I doing as a 
physicist when I add these two terms?” What am I supposed to do with 
b and c? What value should I pick? Simply knowing that the particle has 
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an acceleration a is not enough to tell you where the particle will be. Take 
the case of a particle falling under gravity with acceleration —g. Then 


1 
V(t) = -58 + bt te. (1.7) 


The formula describes every object falling under gravity, and each has its 
own history. What’s different between one object and another object is the 
initial height, y(0) = yo, and the initial velocity v(0) = vo. That’s what these 
numbers b and c are going to tell us. To find c in Eqn. 1.7 put time t = 0 
on the right and the initial height of yo on the left: 


yo =0+0+¢ (1.8) 


which tells us c is just the initial coordinate. Feeding this into Eqn. 1.7 we 
obtain 


1 
y(t) = —58t + bt + yo. (1.9) 


To use the information on the initial velocity, let us first find the velocity 
associated with this trajectory: 


dy 
t)= —_=—gt+b 1.10 
Way = = ge (1.10) 
and compare both sides at t = 0 


vy = b. (1.11) 


Thus b is the initial velocity. Trading b and c for vp and yo, which makes 
their physical significance more transparent, we now write 


1 
y(t) = —58t + vot + Jo. (1.12) 


Likewise for the trajectory x(t) when the acceleration is some constant a, 
the answer with specific initial position x9 and initial velocity vo is 


1 
x(t) = Sat’ + vot + xo. (1.13) 
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In every situation where the body has an acceleration a, the location 
has to have this form. So when I throw a candy and you catch it, you are 
mentally estimating the initial position and velocity and computing the 
trajectory and intercepting it with your hands. (The candy moves in three 
spatial dimensions, but the idea is the same.) 

Now, there is one other celebrated formula that relates v(t), the final 
velocity at some time, to the initial velocity vp and the distance traveled, 
with no reference to time. The trick is to eliminate time from Eqn. 1.13. 
Let us rewrite it as 


1 
x(t) — xo = sat + vot. (1.14) 
Upon taking the time-derivative of both sides we get 
v(t) = at+ vo (1.15) 


which may be solved for t: 


t — 
po Oa (1.16) 
a 
Feeding this into Eqn. 1.14 we find 
1 vO- t)— 
-w= za =" +9] =" (1.17) 
2 a a 
2(¢) — y2 
sran (1.18) 
2a 
which is usually written as 
v? — ve = 2a(x — xo) (1.19) 


where v and x are assumed to be the values at some common generic 
time t. 
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1.5 Sample problem 


We will work through one standard problem to convince ourselves that we 
know how to apply these formulas and predict the future given the present. 
Figure 1.2 shows a building of height yo = 15m. 

I am going to throw a rock with an initial velocity vo = 10m/s from 
the top. Notice I am measuring y from the ground. The rock is going to 
go up to point T and come down as shown in Figure 1.2. You can ask me 
any question you want about this rock, and I can give the answer. You can 
ask me where it will be 9 seconds from now, how fast will it be moving 8 
seconds from now, and so on. All I need are the two initial conditions yo 
and vo that are given. To make life simple, I will use a= —g = —10ms~?. 
The position y(t) is known for all future times: 


y=15+10t—5ť. (1.20) 


Of course, you must be a little careful when you use this result. Say you 
put t equal to 10,000 years. What are you going to get? Yowre going to 
find y is some huge negative number. That reasoning is flawed because you 
cannot use the formula once the rock hits the ground and the fundamental 
premise that a = —10ms~* becomes invalid. Now, if you had dug a hole of 


Ymax 


Yo 


Figure 1.2 From the top of a building of height yọ = 15m, I throw a rock with 
an initial upward velocity of vy = 10m/s. The dotted line represents the trajectory 
continued back to earlier times. 
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depth d where the rock was going to land, y could go down to —d. The 
moral is that when applying a formula, you must bear in mind the terms 
under which it was derived. 

Ifyou want to know the velocity at any time t, just take the derivative 
of Eqn. 1.20: 


v(t) = 10—10t. (1.21) 


Let me pick a few more trivial questions. What is the height Ymax of the 
turning point T in the figure? Eqn. 1.20 tells you y if you know t, but we 
don’t know the time ¢* when it turns around. So you have to put in some- 
thing else that you know, which is that the highest point occurs when it’s 
neither going up nor coming down. So at the highest point v(t*) = 0. From 
Eqn. 1.21 


0=10—10t* which means t*= 1s. (1.22) 


So we know that it will go up for one second and then turn around 
and come back. Now we can find Ymax: 


Ymax = Y(t) = y(1) = 15 + 10 — 5 = 20m. (1.23) 


When does it hit the ground? That is the same as asking when y = 0, 
which is our origin. When y = 0, 


O=15+10t—5t’. (1.24) 
The solutions to this quadratic equation are 
t=3s or t=-—ls. (1.25) 


Why is it giving me a second solution? Can t be negative? First of all, 
negative times should not bother anybody; t = 0 is when I set the clock 
to zero, and I measured time forward, but yesterday would be t = —1 day, 
right? So we don’t have any trouble with negative time; it is like the year 
300 BC. The point is that this equation does not know that I went to a 
building and launched a rock or anything. What does it know? It knows 
that this particle had a height of y = 15 m and velocity v = 10 m/s at time 
t = 0, and it is falling under gravity with an acceleration of —10 ms~?. 
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That’s all it knows. If that’s all it knows, then in that scenario there is no 
building or anything else; it continues a trajectory both forward in time 
and backward in time, and it says that one second before I set my clock to 
0, this particle would have been on the ground. What it means is that if you 
had released a rock at y = 0 one second before I did with a certain speed 
that we can calculate (v(—1) = 20m/s from Egn. 1.21), your rock would 
have ended up at the top of the building when I began my experiment, 
with the same height y = 15m, and velocity vp = 10 m/s. So sometimes 
the extra solution is very interesting, and you should always listen to the 
mathematics when you get extra solutions. 

When Paul Dirac was looking for the energy of a particle in rela- 
tivistic quantum mechanics, he found the energy E was connected to its 
momentum p, mass m, and velocity of light, c, by 


BP =pc+m ct, (1.26) 


in accord with a relation we will encounter in relativity. Now, this quadratic 
equation has two solutions: 


E= Ep + mct. (1.27) 


You may be tempted to keep the plus sign because you know energy 
is not going to be negative. The particles moving, its got some energy 
and that’s it. This is correct in classical mechanics, but in quantum me- 
chanics the mathematicians told Dirac, “You cannot ignore the negative 
energy solution in quantum theory; the mathematics tells you it is there? 
It turns out the second solution, with negative energy, was telling us that 
if there are particles, then there must be anti-particles, and the nega- 
tive energy particles, when properly interpreted, describe anti-particles of 
positive energy. 

So the equations are very smart. When you find some laws in mathe- 
matical form, you have to follow the mathematical consequences; you have 
no choice. Here was Dirac, who was not looking for anti-particles. He was 
trying to describe electrons, but the theory said there are two roots to the 
quadratic equation and the second root is mathematically as significant as 
the first one. In trying to accommodate and interpret it, Dirac was led to 
the positron, the electron’s anti-particle. 
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Returning to our problem, if you were only asking for the maximum 
height Ymax» and not the time ¢* when it got there, there is a shortcut using 


v? = v + 2a(y— y0). (1.28) 
Using v = 0, vo = 10m/s, and a = —10ms~? we find 
Ymax — Yo = 5m (1.29) 


—that is, the rock reached a maximum height of 20m from the ground. 
You can find the speed when it hits the ground (y = 0) using 


v? = 107 +2-(—10)(0— 15) = 400 which means v= +20m/s. 
(1.30) 


The root we should take for when it hits the ground is of course —20m/s. 
As mentioned earlier, the other root +20m/s is the speed with which it 
should have been launched upward, from y = 0 at t = —1, to follow the 
dotted trajectory in the figure. 


1.6 Deriving v— vè = 2a(x — xo) using calculus 


I want to derive Eqn. 1.19, v? — v? = 2a(x — xo), in another way that 
illustrates the judicious use of calculus. 


Start with 

dv 

= 1.31 
aoe (1,31) 


and multiply both sides by v and write v = de in the right-hand side: 


dv dx 


—=7—, 1.32 
"it H ( ) 


Now I’m going to do something that is viewed with suspicion, which is just 
to cancel the dt on both sides. Although I agree that you're not supposed 
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to cancel that d in @, canceling the dt on both sides gives valid results if 
interpreted carefully. Doing so here gives us 


vdv = adx. (1.33) 


This equation tells us that in an infinitesimal time interval [t,t + dt], the 
variables v and x change by dv and dx, and these changes are related as 
above in the limit dx, dv, dt — 0. Now the limit of dx —> 0 or dv > 0 (as 
compared to their ratio) is of course trivial, and Eqn. 1.33 reduces to 0 =0. 
However, the way we interpret and use Eqn. 1.33 is as follows. Suppose in 
the finite time interval [t|, t2], the variable v changes from vı to v2, and 
x changes from x; to x. Let us divide the interval [t), t2] into a very large 
number N of equal sub-intervals of width dt, and let dx and dv be the 
changes in x and v in the interval [t,t + dt]. The relation between these 
changes is given in Eqn. 1.33. If we sum up the N changes on both sides of 
Eqn. 1.33 as N — ov, the sums converge to nontrivial limits, namely the 
corresponding integrals: 


v x2 
J vdv = af dx (1.34) 
vı xX] 


1 1 
512 — M1 = ae — x). (1.35) 
Thus it must be understood that the two sides of a relation like Eqn. 
1.33 are to be ultimately integrated between some limits to obtain a useful 
equality. 
Eqn. 1.19 follows upon setting 


V2 = V, V1 = V0, X2 = X, X1 = X. (1.36) 


CHAPTER 2 


Motion in Higher Dimensions 


2.1 Review 


In the last chapter we took the simplest case, of a point particle moving 
along the x-axis with a constant acceleration a. What is the fate of this 
particle? The answer is that at any time t, the location of the particle is 
given by 


1 
x(t) = xo + vot + 5a (2.1) 


where xp and w are its initial position and velocity. If you took the 
derivative of this, you would get 


v(t) = vo + at. (2.2) 


You can easily check, by taking one more derivative, that this particle does 
indeed have a constant acceleration a. This equation, which gives the ve- 
locity of the object at time f, in terms of its initial velocity and acceleration 
can be inverted to give t in terms of v: 


v= 
pan (2.3) 
a 
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Feeding this into Eqn. 2.1 we obtain the result that makes no reference to 
time: 


v= v + 2a(x — xo). (2.4) 


It is understood v and x correspond to some common time. 

I showed you in the end how we can use calculus to derive this re- 
sult. It is important to brush up on your calculus. When a student says, 
“I know calculus,’ sometimes that means the student knows it, and some- 
times that means he or she once met someone who did. One solution for 
that is to get a copy of a textbook I wrote called Basic Training in Mathe- 
matics. This is a little awkward; I dont want to foist my book on you. On 
the other hand, I dont want to withhold relevant information. If yowre 
going into any science that uses mathematics—chemistry, engineering, or 
even economics—you should find the contents of that book useful. Don’t 
wait for the movie: it is not coming. 


2.2 Vectors in d=2 


The next difficult thing is to consider motion in higher dimensions. Ev- 
erything moves around in d = 3. However, Pm going to use only two 
dimensions for most of the time. Whereas the difference between one di- 
mension and two is very great, that between two and higher dimensions 
is not. Later we will encounter a few concepts that make sense in d = 3 
but not d < 3. String theorists will tell you that actually we need 9 spatial 
dimensions plus time to describe superstrings, which will be discussed in 
depth in Chapter 3,498 of this book. 

Picture some particle that’s traveling in the x — y plane as shown in 
Figure 2.1. This is not an x versus t plot or a y versus t plot. It’s the actual 
path the particle traces out on the x — y plane. You might say “Where is 
time?” One way to mark time is to imagine the particle carries a clock 
with it, and put markers every second. Four representative markers at t = 
1,2,33, and 34 are shown. It obviously is going much slower between 33 
and 34 than between 1 and 2. 

The kinematics of this particle requires a pair of numbers x and y. 
It’s more convenient to lump these into a single entity, called a vector. The 
simplest context in which one can motivate a vector and the rules for deal- 
ing with vectors is to look at movements in the plane. Let’s imagine that 
when I went camping I walked for 5 km from the base camp on the first 
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Figure 2.1 Path of a particle in d = 2. Equal intervals in time are indicated by 
markers on the path numbered 1,2,...,33, and 34. 


day and another 5 km on the second day. How far am I from the base camp? 
You cannot answer that, even if I promised to move only along the x-axis. 
It’s not enough to say I went 5 km. I have to tell you whether I went to the 
right or to the left. So I could be 10 km, 0 km, or —10 km from base. If I say 
not just that I walked 5 km, but specified whether it was +5 km, that takes 
care of all ambiguity in one dimension. 

But in d = 2 the options are not just left and right, but an infinity 
of possible directions. For example, on the first day I could leave the base 
camp at the origin and move along the arrow labeled A to arrive at the 
point labeled 1 in Figure 2.2(a). The second hike is described by the arrow 


(a) (b) 


Figure 2.2 Adding vectors. Part (a) shows how to add vectors and that 
A+B = B + A. Part (b) illustrates the meaning of multiplying a vector by a 
number (2 in this example) and the null vector 0. 
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B, which starts where A ended and brings me to 2. These two arrows are 
examples of vectors and I use them here for describing displacement, or 
changes in position. Vectors can be used to describe many other physical 
quantities, as we will see. 

A vector is an arrow that has got a beginning and an end. This is why 
one says a vector has a magnitude and a direction. The magnitude is how 
long it is, and direction is its angle relative to some fixed direction, usually 
the x-axis. When you refer to a vector A in your notes, you're supposed to 
put a little arrow on top like this: A. In textbooks, vectors are in boldface: 
A. If you dont put an arrow on top or do not use boldface, you're talk- 
ing about just a number A. When applied to a vector A, A stands for its 
length. 

From Figure 2.2(a), we see that there is a very natural quantity that 
you can call A+ B. One day I moved by A and on the next by B. If I want 
to do it all in one shot, what is the equivalent step I should take from the 
start? It’s obvious that the bottom line of my two-day trip is this object C. 
We will call that A + B. It does represent the sum, in the same sense that if 
I gave you 4 bucks and then I gave you 5 bucks, you have the equivalent of 
a single payment of 9 bucks. Here, we are not talking about a single num- 
ber, but a displacement in the plane, and C indeed represents an effective 
displacement due to A and B. 

So here is the rule for adding two vectors that comes from a study of 
displacements: you draw the first one and at the end of that first one, you 
begin the second one, and their sum starts at the beginning of the first and 
ends at the end of the second. 

You can verify, as illustrated in the figure, that A + B is the same as 
B + A where you first draw B and from where B ends you draw A. You will 
end up with the same point, 2, as shown by the sum of the dotted arrows. 

The next thing I want to do is to define the vector that plays the role 
of the number 0, which has the property that when you add it to any num- 
ber, it gives the same number. The vector 0 that I want to call the zero 
or null vector should have the property that when I add it to any vector, 
I should get the same vector. So you can guess who it is: a vector of no 
length. I cannot show you the 0. If you can see it, Pm doing something 
wrong. 

Look at part (b) of Figure 2.2. What if I draw A, then I add to it 
another A to get A+ A. You have to agree that if there’s any vector that 
deserves to be called 2A, it is this guy, A, stretched to twice its length. Now 
we have discovered a notion of multiplying a vector by a number. If you 
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multiply it by 2, you get a vector twice as long and in the same direction. 
Then were able to generalize that and say, if you multiply it by 2.6, you get 
a vector 2.6 times as long. So multiplying a vector by a positive number 
means to stretch it (or shrink it) by that factor. 

Let us keep going. I want to think ofa vector that I can call —A. What 
do I expect of —A? I expect that if I add —A to A, I should get 0, which 
plays the role of 0 among vectors. What should I add to A so I get the null 
vector? It’s clear that you want to add a vector that looks like —A in part (b) 
of Figure 2.2, because, if you go from the start of A to the finish of —A, you 
end up where you started and you get this invisible 0 vector. So the minus 
vector is the same vector flipped over, pointing the opposite way. That’s 
like —1 times a vector. Once you have got that, you can do —7.3 times a 
vector: just take the vector, rescale it by 7.3 and flip it over. Multiplying a 
vector by a number is called scalar multiplication, and ordinary numbers 
are called scalars. You can do more complicated things. You can take one 
vector, multiply it by one scalar, take another vector, multiply that by an- 
other scalar, and add the two of them. We know what all those operations 
mean now. You don’t have to memorize the rules for all this. The only rule 
is: “Do what comes naturally.” Do what you normally do with ordinary 
numbers. 


2.3 Unit vectors 


Let us go back to the same x — y plane. I’m going to introduce two very 
special vectors. They are the unit vectors: i and j, pointing along the x and 
y axes and of unit length, as shown in Figure 2.3. If I had a third axis per- 
pendicular to the page, I would draw a k, but we don’t need that yet. I 
claim I can write any vector you give me as a number A, times i, plus a 
number Ay times j. There’s nothing you can throw at me that lies in the 
plane that I cannot describe as some multiple of i plus some multiple of 
j. It’s intuitively clear, but I will just prove it beyond any doubt. Here is 
some vector A. It is clear from the figure that it is the sum of the dot- 
ted horizontal vector and the dotted vertical vector, by the rules of vector 
addition. The horizontal part, parallel to i, has to be a multiple of i. We 
know that because we can stretch i by whatever factor we like. Call that 
factor Ax, which happens to be positive in this example. It is called the 
x-component of A or the projection of A along i or along the x-axis. The ver- 
tical part is likewise jAy where A, is the y-component of A, or the projection 
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Figure 2.3 The unit vectors i and j and an arbitrary vector A= iA, + jAy built 
out of them. 


of A along j or along the y-axis. Therefore I have managed to write 
Aas 


A=iA,+jAy. (2.5) 


We refer to the pair i and j, in terms of which any vector can be 
expressed, as basis vectors or as the basis. 

If you gave me a particular vector A as an arrow of some length A and 
orientation 0 relative to the x-axis, what do I use for Ay and Ay? You can 
see from trigonometry that 


Ax = Acos@ (2.6) 
Ay = Asinð. (2.7) 


Conversely, given the components, the length and angle are 


A=, 42 +42 (2.8) 
A 
0 = tan”! ra (2.9) 
= 


Eqns. 2.6 to 2.9 will be invoked often. So please commit them to memory. 
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If you give me a pair of numbers, (Ax, Ay), that’s as good as giving 
me this arrow, because I can find the length of the arrow by Pythagoras’s 
theorem and I can find the orientation from tan = A: You have the op- 
tion of either working with the two components of A or with the arrow. 
In practice, most of the time we work with these two numbers, (Ax, Ay). 
In particular, if we are describing a particle whose location is the position 
vector r, then we write it in terms of its components as 


r=ix+jy. (2.10) 


The changes in r are the displacement vectors and examples are A and B in 
Figure 2.2 that described the two hikes. 

I have not given you any other example of vectors besides the dis- 
placement vector, but at the moment, we’ll define a vector to be any object 
that looks like some multiple of i plus some multiple of j. If I tell you to 
add two vectors A and B, you have got two options. You can draw the ar- 
row corresponding to A and attach to its end an arrow corresponding to B, 
and then add them, as in Figure 2.2. But you can also do the bookkeeping 
without drawing any pictures as follows: 


A+B = iA; + jAy +iBy + jBy (2.11) 


so that the sum C is the vector with components (Ax + Bx, Ay + By). 

In the above, I have used the fact that vectors can be added in any 
order. So I grouped the things involving just i and likewise j. Then I argued 
that since iA, and iB, are vectors along i, their sum is a vector of length 
Ax + Bx also along i. I did the same for j. 

In summary if 


A+B=C (2.13) 
then 

Cy = Ax, + By (2.14) 

C,=Ay +B, (2.15) 


which can be summarized as follows: 
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To add two vectors, add their respective components. 

An important result is that A = B is possible only if Ax = Bx and Ay = 
B,. You cannot have two vectors equal without having exactly the same x 
component and exactly the same y component. If two arrows are equal, 
one cannot be longer in the x direction and correspondingly shorter in 
the y direction. Everything has to match completely. The vector equation 
A = Bis actually a shorthand for two equations: A, = By and Ay = By. 


2.4 Choice of axes and basis vectors 


I have in mind a vector whose components are 3 and 5. Can you draw 
the vector for me? If you immediately said, “It is 3i + 5j, yowre making 
the assumption that I am writing the vector in terms of i and j. I agree 
i and j point along two natural directions. For most of us, given that the 
blackboard or notebook is oriented this way, it is very natural to line up 
our axes with it. But there is no reason why somebody else couldnt come 
along and say, “I want to use a different set of axes. The x and y axes or 
i and j are not nailed in absolute space. They are human constructs and 
were not wedded to any of them.” 

Quite often, it’s natural to pick the axes in a certain way to suit the 
problem. If you are studying a cannon ball launched from the earth, it 
makes sense to pick the horizontal as the x-axis and the vertical as the y- 
axis, but, mathematically, you don’t have to. Another set of rotated but 
mutually perpendicular unit vectors i’ and j’ that form another basis can 
also be rescaled and added to form any given vector A in the plane. For 
example, when we study objects sliding down an inclined plane, we will 
choose our axes parallel and perpendicular to the incline. 

If I draw an arrow A on a blank sheet of paper, it has life of its own 
without reference to any axes. The same vector A can be written either in 
terms ofi and j, which is the old basis, or in terms of i’ and j’, the new basis. 
How do the components (Ai, Ay) in the new basis relate to the components 
of the old basis? It’s a simple problem, but I just want to do it so you get 
used to working with vectors. 

For this we need the very busy Figure 2.4. It shows the old x and y axes 
and the x’ and y’ axes obtained by rotating the x — y axes counterclockwise 
by an angle ¢. The unit vectors i’ and j’ are likewise rotated versions of 
i and j. The components in the two bases are shown by dotted lines and 
are simply the projections of A along the various axes. We want to relate 
(A), Aj) to (Ax, Ay). 
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y 


Figure 2.4 The same vector A is written as iA, + jAy in one frame and as 
iA’, +j’A, in the other. The dotted lines indicate the components in the two 
frames. 


First we express i’ and j’ in terms of i and j using the figure: 


i =icosp+jsing (2.16) 
j =jcos¢ —ising. (2.17) 


Here are the details. The vector i’ has got a horizontal part, which is its 
length, namely, 1, times cos ¢, and a vertical part that is 1 times sing. How 
about j’? It is at an angle ¢ relative to j. So its y component is cos ¢. Finally, 
its x or horizontal component is (—sin@), where the minus sign comes 
because it is pointing to the left, along the negative x-axis. All that remains 
now is to eliminate i’ and j’ in favor of i and j in A=i'A,, +j’A,, and equate 
it to A written in terms of i and j: 


A=¥A,+jA4, (2.18) 
= (icos¢+jsing)A,. + (jcosd —ising)A, (2.19) 
=i(A;cosġ — A, sing) + j(A,,sing + A, coso) (2.20) 


= iAy +jAy. (2.21) 
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When we equate the coefficients of i and j on the right-hand sides of Eqns. 
2.20 and 2.21, we obtain the desired expression for A, and Ay, in terms of 
Al. and Ay; 


A= A cosh — A, sing (2:22) 
Ay = A, sing FA, cos. (2.23) 


So, you can pick your basis vectors any way you like and so can I. Your 
basis is obtained from mine by a counterclockwise rotation by an angle 
ġ. The same entity A, the same arrow which has an existence of its own, 
independent of axes, can be described by you and by me using different 
components. Your components with primes on them are related to mine 
by Eqns. 2.22 and 2.23. This is called the transformation law for the vector 
components under rotation of basis vectors. 

Now, you can ask the opposite question. How do I get A‘, and ay in 
terms of Ay and Ay? The quickest way is to replace ¢ by —¢: if we go from 
the unprimed to the primed system by a rotation ¢, then rotation by —¢ 
is the way to go from the primed to the unprimed basis. The result, using 


cos(—) = cos¢ and sin(—¢) = — sin 4, is 


A. = A, cosh + Aysing (2.24) 
Ay = — Aysin + Aycos¢. (2.25) 


That turns out to be the correct answer. But I want you to think about 
another way to show this, which often seems to bother some students. If I 
told you 


3x+5y=21 (2.26) 
4x + 6y = 26, (2.27) 


you certainly know how to solve for x and y, right? You have got to juggle 
the two equations, multiply the first by 6, the second by 5, and subtract to 
isolate x and so on. Why is it when some of you see Eqns. 2.22 and 2.23, 
you don’t realize it’s the same kind of problem, where you can multiply 
Eqn. 2.22 by cos¢, Eqn. 2.23 by sing and add to isolate A‘, for example? 
For any particular value of ¢, sing and cos@ are just some numbers. For 
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T 


example, if I pick é = 3 = 60°, cosġ = 5 and sing = se The equations 
become (for this angle) 


L, v3, 
V3 o 1l, 


If you multiply the second by /3 and add it to the first you obtain 


Ax + V34, — 2A. which means (2.30) 
1 4/3 
T _ 2 
= A, cos 3 + 4,sin 3 (2.32) 


in accordance with Eqn. 2.24. So go forth and treat sing and cos¢ 
as plain numbers and juggle Eqns. 2.22 and 2.23 to derive Eqns. 2.24 
and 2.25. Along the way of course you will have to use identities like 
sin? f+ cos? ¢ = 1. 

The components of the vector depend on who is looking at the vector. 
However, there’s one quantity that’s going to come out the same, no matter 
who is looking at the vector. It is the length of the vector. It is unaffected 
by the rotation of axes. It is an invariant under rotations. You may verify 
from Egns. 2.24 and 2.25 that 


(A, + (A,)? = (Ax coso + Ay sing)? + (—Axsing + A, cos p)? 


(2.33) 
= A2(cos* $ +sin? d) + Aj (sin? $ + cos?) 

(2.34) 
=A +A. (2.35) 


The AxA, term is gone since its coefficient is 2(cos ø sin ġ — sing cos ø). 

I want to conclude with one important point. We learned that a vec- 
tor is a quantity that has a magnitude and a direction. A more advanced 
view of vectors is that they are a pair of numbers (in d = 2) which, un- 
der rotation of axes, transform as per Eqns. 2.24 and 2.25. Anything that 
transforms this way is called a vector. We already know about the position 
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vector r and the changes in it, the displacement vectors (used in describing 
the hike). How about more vectors? There turns out to be a very nice way 
to produce vectors, given one vector like the position vector. And that’s the 
following. 


2.5 Derivatives of the position vector r 


Let’s take a particle in the x — y plane that moves from r at time t to r+ Ar 
at time t+ At as in Figure 2.5. At time t its location is 


r=ix(t)+jy(t) (2.36) 
and at t+ At itis 


r+ Ar=i(x(t) + Ax)+j((t)+ Ay) so that, (2.37) 
Ar=iAx-+jAy and by the usual limiting process, (2.38) 
Ar dr „dx dy 


v= lim = 


At>o At a a ae 


(2.39) 


Figure 2.5 The particle moving along some curved path goes from r at time t to 
r+ Ar at time t + At. The velocity v is the limit of the ratio Ar as At > 0, and 
thus parallel to Ar, which eventually becomes tangent to the curve. 
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When you move just along the x-axis, you wait a small time At and 
you move by an amount Ax, and their ratio gives the velocity in the ap- 
propriate limit. When you move in the plane, your position and its change 
are both vectors. 

Can you see why the derivative of a vector is also a vector? Because 
Ar, the difference in the vector between two times, is itself a vector. Di- 
viding it by At is like multiplying by 1/At, but we know that when we 
multiply a vector by a number, we simply rescale the vector. So the limit 
will be some arrow that we call the instantaneous velocity vector. It will be 
tangential to the curve r(t) and point toward the instantaneous direction 
of travel. 

If I gave you the location of a particle as a function of time, you 
can find its velocity by taking derivatives. For example, if I say a particle’s 
location is 


r=fitorj (2.40) 
then its velocity at time t is 
v=2ti+27ť j. (2.41) 


You can take a derivative of the velocity or the second derivative of 
the position to get the acceleration vector 


dv dr . Lo. | 

a(t) = m 2i+54tj (in our example). (2.42) 

You can then also multiply a by the mass m, which is a scalar un- 
affected by rotations, to get a vector ma, which Newton’s law equates to 
another vector, the force F. 

Even though we started with one example of a vector r, were now 
finding out that its derivative has to be a vector and the derivative of the 
derivative is also a vector. When you learn relativity, you will find out 
there’s again one vector that’s staring at you, the analog of the position vec- 
tor, but with four components. But more vectors can be manufactured by 
multiplying vectors by scalars (like mass) or taking derivatives with respect 
to a parameter that plays the role analogous to time. 

Here is an illustration of vector addition and differentiation. Imagine 
an airplane in flight, as depicted in Figure 2.6. Let rpg be the location of a 
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Airplane 


Ground 


Figure 2.6 The position of the ball relative to (some origin on) the ground rpg is 
the vector sum of the position of the ball relative to the (tail of the) plane, bp 
and the position of (the tail of) the plane rpg relative to the ground. 


fixed point in the airplane, say the tail, with respect to a fixed point on the 
ground. Imagine that in the airplane there is a ball located at rpp as mea- 
sured from this fixed point in the airplane. By vector addition the location 
of the ball with respect to the ground is 


Cbg = Fpp + Ypg- (2.43) 


Upon taking a time derivative and in the same notation, the law of 
composition of velocities follows: 


Vig = Vbp + Vpg» (2.44) 
which says the velocity of the ball as seen by a person on the ground is 
the velocity of the ball relative to the airplane plus the velocity of the air- 
plane relative to the ground. Taking yet another derivative we may relate 
the accelerations: 


abg = abp + apg. (2.45) 


In the special case of an airplane moving at constant velocity, apg = 0. Then 
we find 


abg = Apg, (2.46) 
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which means, in this case, the acceleration of the ball is the same as mea- 
sured by an observer on the ground and an observer on the airplane. These 
results will be recalled in our study of relativity. 


2.6 Application to circular motion 


Now we'll take a concrete problem where you will see how to take deriva- 
tives to obtain very useful results. Pm going to write a particular case of 


r(t): 
r(t) = R(icosmt+jsinot) (2.47) 


where R and œ are constants. What is going on as a function of time? 
What’s this particle doing? Look at the length squared of this vector: 


r2 + i = R? (cos? ot + sin? ot) = R. (2.48) 


That means the particle is going around in a circle of radius R as shown in 
Figure 2.7. The x component is Rcoswt and the y component is Rsinwt 


V(1r0/2)=-i wR 
\V(0)=j wR 


Vin)=-j wR V(31/2)=i wR 


Figure 2.7 The particle moves along a circle of radius R with an angular 


velocity œ. 
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where o is a fixed number. As t increases, this angle wt increases and the 
particle goes round and round. Let’s get a feeling for œ. As time increases, 
the angle increases and we can ask how long it will take the particle to come 
back to the starting point. Suppose the starting point was on the x-axis. As 
t increases, wt increases, and the particle will come back at a time T such 
that 


oT =2z. (2.49) 


Thus øo is related to the time period T by 


2 
g= = —2nf (2.50) 


where f = + is the frequency or number of cycles per second. It is mea- 
sured in Hz, which stands for Hertz. Since in every cycle the particle rotates 
by 2m, and it completes f revolutions per second, œ = 27f is called the 
angular velocity and measures the radians swept out per second. 

Notice that in equating a full cycle to 2x I am using radians and not 
degrees to measure angles. For those who have not seen a radian, it’s just 
another way to measure angles, wherein a full circle, which we used to 
think was worth 360°, now equals 27 radians. Since 27 ~ 6.3, a radian is 
roughly 60°. You will see the advantages of using radians later. For now just 
remember that a half circle, instead of being 180°, will now be z radians, 
and a quarter circle will be 5, and so forth. 

How fast is this particle moving? It’s going around a circle, the angle 
is increasing at a steady rate w, and so we know it’s going at a steady speed. 
Let us verify that by computing the velocity 


dr(t) 
j= 2.51 
v(t) ae (2.51) 
25 ooo gua (2.52) 
=S UT at : 
= Ro(—isinwt+jcosat). (2.53) 


At t = 0, the velocity is v = Rwcos0 j = Raj, so it is moving straight up 
at speed v = œR. You may verify that it has the velocity as shown in the 
figure at later times. The magnitude of the velocity is always wR although 
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the direction is changing. From the figure we see it remains tangential to 
the circle. The constancy of the speed v at an arbitrary time may also be 
established by computing 


v? = (œR)? (sin? wt + cos’ wt) = (@R)* (2.54) 
v=oR. (2.55) 


Remember the tangential velocity is v= oR. 
Let’s take the derivative of the derivative to find the acceleration a and 
its magnitude: 


a= —w’ R(icosat +jsinat) =-o'r (2.56) 


a=0°R. (2.57) 


That’s a very important result. It tells you that when a particle moves 
in a circle of radius R at constant speed v, it has an acceleration, called the 
centripetal acceleration, directed toward the center and of magnitude 


_ (wR)? _ v? 


2 
=o R —. 2.58 
a= a z (2.58) 


This acceleration at constant speed reflects the fact that velocity is a vector 
and you can change the velocity vector by changing its direction. For ex- 
ample, if a car is going on a racetrack and the speedometer says 60 miles 
per hour, the lay person’s view is that the car is not accelerating. But you 
will say from now on that it indeed has an acceleration equal to E even 
though no one’s stepping on the accelerator or the brake. 

Suppose the particle is not moving fully around a circle but travers- 
ing just a quarter of the circle. When it is traveling the quarter of a circle, it 
has the same acceleration directed toward the center of that quarter circle. 
In other words, you don’t have to be moving actually in a circle to have the 
acceleration ss At any instant, the curve you are following can be locally 


. A A 2 
approximated as part of some circle, and, in the formula a = g, the accel- 
eration is directed toward the center of that circle, R is its radius and v the 
instantaneous tangential velocity. 
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2.7 Projectile motion 


I want to consider a particle for which rp and vo are the position and 
velocity at t = 0 and which has a constant vector acceleration a. What 
is its location at all future times? By analogy with what I did in one 
dimension 


1 
r(t) = ro + vot + sar. (2.59) 


Once you know ro and vo, you can find the position of the object at all 
future times. Let’s take one simple example. Somebody in a car has decided 
to drive off a cliff as shown in Figure 2.8(a). We want to know when and 
where the car hits the ground. 

We pick our origin (0,0) at the foot of the cliff. Let the height of 
the cliff be h. The car is traveling with some initial speed vox in the hor- 
izontal direction. Equation 2.59 is really a pair of equations, one along x 


and one along y with a = —jg, vo = voxi, and ro = hj. Separating out the 
components 
x(t) = 0 + voxt +0 (2.60) 
1 
y(t) =h+0- 5st (2.61) 


Notice that the evolution of the two coordinates is completely indepen- 
dent. The time t* when the car hits the ground (y = 0) satisfies the 


y 
0,h) Vo 
x 
0) 0) d 0) 0.05 0.10 0.15 0.20 0.25 0.30 0.35 
(0,0) (d,0) (0,0) (R,0) 
(a) (b) 


Figure 2.8 (a) A car flies off the cliff at (0, h) and lands at (d,0). (b) A projectile 
is launched with initial velocity vp = (i + /3j)m/s. The range is R= 0.35 mand 
the maximum height reached is Ymax = 0.15 m. 
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equation 
1 *2 
0=h- set (2.62) 
: 2h 
i =./—. (2.63) 
& 


This is exactly how long it would take to hit the ground had it simply top- 
pled over the edge from rest. The horizontal velocity does not delay the 
crash one bit (unless you take into account the curvature of the earth). As 
to where the car lands, the location is given by (x(t*),0) = (d,0) where 


{2h 
d= Voxt™ = Vox, | —. (2.64) 
£. 


Finally the problem of projectile motion is depicted in Figure 2.8(b). 
You fire a projectile from (0,0) with some velocity vo at some angle 8. It 
will go up and then come down, moving horizontally at the same time. 
Where is it going to land? What is the maximum height Ymax to which it 
rises? With what speed will it hit the ground? At what angle should you fire 
your projectile so it will go the furthest? 

Here are the equations that contain all the answers, namely Eqn. 2.59 
written out in component form: 


x(t) = 0 + voxt = vo cosd - t (2.65) 


1 1 
y(t) =0 + voyt — 58 = vosind -t— 5a (2.66) 


You can solve them but it is good to have an idea of what’s coming. 
Imagine you have this monster cannon to fire things. It has a fixed muzzle 
speed, vo, but allows you to fire at any angle. How do you aim it so the ball 
goes as far as possible? There are two schools of thought. One says, aim at 
your enemy and fire horizontally. Then the ball lands on your foot because 
it has zero time of flight (assuming the cannon is at zero height). The other 
school says, maximize the time of flight and point the cannon vertically. It 
goes up, stays in the air for a very long time, and lands on your head. Then 
it hits you: the correct answer is somewhere between 0 and 90° = 4. The 
naive guess 45° = 4 turns out to be correct. 
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Now I want to show you how to use the equations to prove this. 
What’s the strategy for finding the range? You see how long the ball is in the 
air and multiply that by the constant horizontal velocity vo cos@. Again, let 
t* be the time when it hits the ground. The y-equation 


il 
0=t* (» sin? — zer) (2.67) 


has two solutions: 


2vo sind 
sgan, (2.68) 
£ 
So the cannon ball is on the ground on two occasions. One is initially. 
We are not interested in that trivial solution. If the time you are interested 
in is t* Æ 0, youre allowed to divide both sides of Eqn. 2.67 by it and get 


avsind 
(a (2.69) 
g 


and the range 


2v-sinOcosO  vžsin20 
R= xt* = vycos@: t* = —2 = (2.70) 
g g 


using sin20 = 2sin@cos@. For the greatest range we must make sin20 as 
large as possible, which occurs for 20 = 5 or 0 = 7 = 45°. For any smaller 
range you will find there are two possible angles that work since sin(20) = 
sin(z — 20). 

The maximum height is the y-coordinate at the halfway time it = 
vo sind , 


£ 


rer eee pe 
= Voy— a = 
Pinak 3° T325 


vo sind s[e] ve sin? 0 


= (vy sin@) 58 z aa (2.71) 
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We could equally well find the half-time 5 t* by setting the vertical velocity 
to zero: 


t* 
0 = vsinĝ — 8z (2.72) 


How about the velocity at impact? The horizontal part is of course 
vp cos@ since there is no acceleration in that direction. The vertical part 
starts out at vo sin@ and decreases at a rate g: 


Vy = vo sind — gt (2.73) 


so that at t* it has a value 


2vo sind 
v(t*) = vsind — g—" = —w sind, (2.74) 
& 


which is just the opposite of the initial vertical speed. The magnitude of 
the final velocity is the same as the initial one since reversing one of the 
components of v does not change its length. 

There are endless variations. You pick some point off the ground at 
(X,Y) and want the projectile to arrive there. You are given the launch 
angle @ and have to find the launch speed vp. How do you do that problem? 
You assume the projectile arrives at the destination (X, Y) at some time t*. 
You go to the x equation and demand that x(t*) = X and solve for t*. Plug 
this time into y and demand y(t*) = Y and solve for vo. 

Sometimes the problems are embellished to make everyone feel in- 
volved. For example, instead of a ball dropping, nowadays there’s a monkey 
or horse that is falling down, so people in life sciences can say, “Hey, we 
should learn physics since it seems to have applications to our subject.” All 
those gyrating creatures are very interesting and look great in color, but, in 
the end, you are told, “Treat the horse as a point particle.” If you're going to 
treat the horse as a point particle, why include its color picture? But I agree 
there are times when only a horse will do. When the Godfather wants to get 
the contract for Johnny Fontaine, he doesn’t tell his consigliere, “Hey, Tom, 
put half a point particle on Jack’s bed.” That would have been a disastrous 
approximation. 

By the way, don’t forget to treat the falling car in Figure 2.8 as a point 
particle. 


CHAPTER 3 


Newton’s Laws I 


3.1 Introduction to Newton’s laws of motion 


This is a big day in your life: you are going to learn Newton’s laws, in terms 
of which you can understand and explain a very large number of phenom- 
ena. It’s really amazing that so much information can be condensed into 
three laws. 

Your reaction may be that you have already seen Newton’s laws, that 
you have applied them in school. I realized fairly late in life that they are 
more subtle than I first imagined. It’s one thing to plug in all the numbers 
and say, “I know Newton’s laws and I know how they work.” But as you get 
older and you have more spare time, you think about what you are doing. 
This is something I have had the luxury of doing, and I have realized the 
laws are more tricky. I want to share some of that understanding. 

The first law, or the law of inertia, says, “If a body has no forces acting 
on it, it will maintain its velocity.” In other words, in the absence of external 
forces, a body at rest will remain at rest and a body in motion will retain its 
velocity. It is not surprising that a body at rest will remain so if not acted 
upon by a force. We see that all the time. I place an eraser on the table. It 
will stay put unless I do something to it. 

The great discovery that Galileo and Newton made was that you 
don’t need a force for a body to move at constant velocity. You don’t 
see that in daily life—everything seems to come to rest unless you keep 
pushing or pulling it. But we all know that the reason things come to a 
halt is that there is always some friction or drag bringing them to rest. 
If you take a hockey puck on an air cushion and give it a push, it seems 
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it can travel for a very long time. Galileo and Newton abstracted from this 
an ideal situation in which friction was totally eliminated and the bodies 
kept moving forever with no help. If you go to outer space, you can check 
for yourself that if you throw something, it will go on forever without your 
intervention. It’s in the nature of things to retain a constant velocity. It is 
not velocity, but a change in velocity, that calls for a force. 

The law of inertia is not valid for everybody. Pll give an example 
from your own life. You go on an airplane and then, after the usual de- 
lays, the plane begins to accelerate down the runway. At that time, if you 
leave anything on the floor, you know it’s no longer yours. It’s going to 
slide backward and the physicist in the last row is going to collect every- 
thing. That is an example of a frame in which the law of inertia does not 
work: bodies accelerate with no applied force. Once the plane stops accel- 
erating, the law of inertia becomes operative. It fails when it decelerates 
during landing when everything now slides to the front. 

If Newton’s law of inertia works for you, you are called an inertial 
observer and your frame of reference is called an inertial frame. The plane 
that’s taking off is not an inertial frame, but the one that is cruising is. 
The earth seems to be a pretty good inertial frame, because if you leave 
something at some place, it just stays there—unless the thing is your cell 
phone and the place is Grand Central Station. But this is not a violation of 
Newton’s laws, just the laws of New York City. 

Although not every frame is inertial, there are plenty of inertial 
frames to go around. If you find even one inertial observer, namely one 
person for whom this law of inertia works, I can find for you an infinite 
number of other people for whom this is true. Who are these people? They 
are people moving at constant velocity with respect to the first inertial ob- 
server. Suppose you are in a train and you're moving past me with velocity 
u, and we both look at some object with no forces on it. We will not agree 
on its velocity or the velocity of anything: things at rest for me will be mov- 
ing backward for you at velocity —u, and everything at rest in your train 
will be moving at velocity u according to me. In short, you and I will differ 
on the velocity of any object by our relative velocity. But we will agree on 
the acceleration of any object since it is unaffected by adding a constant to 
velocity. Adding a constant velocity to objects does not change the fact that 
those which were maintaining constant velocity still maintain a constant 
(but different) velocity. Thus neither is every observer inertial, nor is an 
inertial observer unique. 
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You must know the earth is not precisely inertial. The earth has an 
acceleration. Can you see why? Yes, it’s spinning around itself and go- 
ing around the sun, both of which constitute accelerated motion. But the 
acceleration due to motion around the sun at speed v and radius r is 
a= = .006 ms~?, which is a very small number, say compared to g. 
The same goes for the acceleration due to the earth’s rotation about its 
own axis, which is roughly .03ms~? or roughly g/300 at the equator. 

The first law might seem tautological because we never see anything 
that retains its velocity forever, and every time we see velocity change we 
say that a force is acting. But it’s not a big hoax, because you can set up ex- 
periments in free space, far from everything, where objects will, in fact, 
maintain their velocity forever. It’s a useful concept even on the earth, 


because the earth is approximately inertial. 


3.2 Newton’s second law 


Newton’s second law says, “If a body has an acceleration a, then you need 
a force 


F=ma (3.1) 


to produce that acceleration.” 
In this chapter we will focus on one dimension and write 


F=ma (3.2) 


where F and a are along the x-axis. 

A few words about units. Acceleration is measured in ms~*. Mass is 
measured in kilograms or kg. So force has units kilogram meters per sec- 
ond squared. But we get tired of saying that long expression, so we call that 
a Newton, denoted by N. If you had invented mechanics, we'd be calling it 
by your name, but it is too late for you now. 

Here is a typical problem that you may have solved in your first pass 
at Newton’s laws. A force of 36N is acting on a mass 4 kg. What’s the accel- 
eration? You divide 36 by 4 and you find it is a= 9 ms~?. You say, “Okay, I 
know Newton’s laws.” 

It’s actually more complicated than that. Take yourself back to the 
seventeenth century, when Newton was inventing these laws. You have an 
intuitive definition of force: when somebody pushes or pulls an object we 
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say a force is acting on it. Suddenly, you are told there is a law F = ma. 
Are you better off in any way? Can you do anything with this law? What 
does it help you predict? Can you even tell if it’s true? Here’s a body that’s 
moving. Is Newton right? How are we going to check? Well, you want to 
measure the left-hand side and you want to measure the right-hand side. 
If they’re equal, you will say the law is working. What can you measure in 
this equation? 

Let’s start with acceleration. What’s your plan for measuring accel- 
eration? What instruments will you need? If you say a watch and ruler, 
that would be correct provided by ruler you dont mean Queen Elizabeth. 
Here is your ruler and here is a Rolex. Tell me exactly how you plan to 
measure acceleration. Everyone seems to know the answer. First, let it go 
a little distance, and take the distance over time. That gives you the veloc- 
ity now. Let it go a little more, and repeat the velocity measurement. Take 
the difference of the two velocities and divide by the difference of the two 
times, and you have got the acceleration. Since the body has moved a fi- 
nite distance in a finite time, this gives the average acceleration. You want 
to make these three positional measurements more and more rapidly. In 
the end, as all the time intervals shrink to 0, you will measure what you 
can say is the acceleration now, the second derivative 4 1 “+ = a(t) defined in 
calculus. 

Back to testing if what Newton told you is right: You see an object in 
motion, you measure a, and you get a certain numerical value, say 10 ms~?. 
But that’s not yet testing the equation, because you still have to find F and 
m. What’s the mass of this object? One common idea is to take a standard 
mass and balance the unknown mass on a seesaw by adjusting its position. 
But suppose you were in outer space. There’s no gravity. Then the seesaw 
will balance even if you put a potato on one side and an elephant on the 
other side. What you are doing now is appealing to the notion of mass as 
something that’s related to the pull of the earth on the object. You have got 
to go back and wipe out everything you know. If F = ma is all you have, 
there is no mention of the earth in these equations. You only know how to 
measure a, but not the other two. So you have a problem. You cannot say 
that since F = ma, it follows that m = E, that is circular reasoning since 
you have not told me how to measure F either. 

Let me give you a hint. How do we decide how long a meter is? You 
seem to know that it is arbitrary. A meter is not deduced from anything. 
Napoleon or somebody said, “The size of my ego is one meter.” That’s a 
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new unit of length. Seriously, at the National Bureau of Standards there 
used to be a rod made of some special alloy in a glass case, and that defined 
the meter. There are fancier definitions now, but let us stick to this simple 
one. (See http://physics.nist.gov/cuu/Units/meter.html for more details on 
definitions of units.) Then I ask you, “What is two meters and what is three 
meters?” We have ways of handling that. You take the meter and attach it to 
a duplicate, and that’s two meters. You can cut it in half, using dividers and 
compasses; you can split the meter into any fractions you like. Likewise for 
mass, we will take a chunk of some material and we will call it a kilogram. 
That is a matter of convention, just like one second is some convention. 

I’m going to give you a glass case that contains a block of some metal 
defined as one kilogram. Then I give you another object, an elephant. 
What’s the mass of the elephant? Here is a hint: I also give you a spring. 
We cannot do the seesaw experiment because it requires gravity. A spring, 
on the other hand, will exert a force even in outer space. Here’s what we 
do. We hook one end of the spring to a wall and we pull the other end from 
its equilibrium position by some amount and we attach the one-kilogram 
mass to it. We don’t know what force it exerts, but it will not matter. We let 
it go and measure the initial acceleration, a,. Then we bring the elephant 
(another point particle) of unknown mass mg, pull the spring by the same 
amount so it can exert the same force, and find the acceleration ag of the 
elephant. Assuming only that the same extension produces the same force 
in the two cases, we have 


1l- a) = MẸ ag (3.3) 
mesie (3.4) 
aE 


Once you have the mass of the elephant you can use it to measure any 
other mass mo by using 


AE 
Mo = Mg — (3.5) 
Ao 


where ag and ao are produced by the same force. 

There are subtleties even here. For example, how do we know that 
when we pull the spring the second time with the elephant, it will exert 
the same force as the first time when the 1 kg was attached to it? After all, 
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springs wear out. Thats why you change the shock absorbers in your car. 
So first, we have to make sure the spring exerts a fixed force every time 
(for a given extension). How are we going to check that? We don’t have 
the definition of force yet. But we can do the following. We pull the one- 
kilogram mass and let it go, and we note the acceleration. Then, we pull it 
again, by the same amount, and let it go; we do it ten times. If every time 
we get the same acceleration, we are convinced this is a reliable spring that 
is producing the same force under the same conditions. On the eleventh 
time we pull the spring and attach the elephant. With some degree of con- 
fidence, we can say we are applying the same force on the elephant as on 
the one-kilogram mass. 

Why is this discussion so important? Because you need to know that 
everything you or I write down in the notebook or on the blackboard as 
a symbol is actually a measurable quantity, or, as they say in France, Les 
Mesurables. You should know at all times how you measure anything that 
enters your theory or calculation. If not, you are just doing math or playing 
with symbols. You are not doing physics. 

This discussion also tells you that the mass of an object has nothing 
to do with gravitation but with how much it hates to accelerate in response 
to a force. Newton tells you forces cause acceleration. But the acceleration 
is not the same on different objects for a given force. Certain objects resist 
it more than others. They are said to have a bigger mass. We can be precise 
about how much bigger by saying, “If the acceleration of a body in response 
to a given force is a that of a 1-kilogram mass, then the mass of the body 
is 10 kilograms.” 


3.3 Two halves of the second law 


We have seen how all objects can be attributed a mass. Now go back to the 
spring. I want to know how much force F(x) it exerts when I pull it by a 
certain amount x, which is measured from the point when the spring is 
neither compressed nor expanded as shown in Figure 3.1. If x is positive, 
it means the spring is stretched; if it’s negative, it means the spring is com- 
pressed. Now I can measure F(x) for any given x because I can measure the 
acceleration it produces on a known mass m and use F = ma. So I pull it by 
various amounts, measure F, and draw a graph. It will be a straight line 
with some slope —k for small values of x: 


F=-kx (3.6) 
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F(x) 


F(x)=-kx 


Figure 3.1 (Left) The mass m is attached to a spring (dotted line) of force 
constant k. The other end of the spring is attached to a wall. The displacement x 
is measured from the equilibrium position of the spring. (Right) The force F(x) 
as a function of x. 


where k is called the force constant. The minus sign says, if you pull it to 
the right, so that x is positive, the spring will exert a force in the negative 
direction. If you compress it, then x is negative and the spring will exert 
a force in the direction of increasing x. All springs will have a graph like 
this for small enough distortions from equilibrium. Beyond that the line 
may bend or the spring may even snap. In any case, we now have a way to 
measure k for any given spring in the regime when F(x) is linear in x. 

I want you to think about the two equations F = ma and F = —kx. If 
the first one is Newton’s law, then what’s the other one? What’s the differ- 
ence between saying F = —kx and F = ma? Let me paraphrase the answer 
I usually hear from students: “F = ma is universally true, independent of 
the nature of the force acting on a body, while F = —kx is only describing 
the spring.” That is essentially correct, and I will now elaborate. 

The cycle of Newtonian dynamics has two parts. 

The first one is to find the acceleration a of a body, the force on 
which is somehow known, using a = E The force F is the cause and the 
acceleration a is the effect and a = z is the precise relation between them. 

The second is to deduce experimentally what force F will be acting 
on a given body at any given time. For example, if a mass is attached to 
a spring that has been extended by x, we must do the experiment to find 
that the spring force is F = —kx. Newton does not tell us that. He never 
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tells us what F is in a given context, with the exception of gravity where 
he also furnished the left-hand side of F = ma with his law of universal 
gravitation. If two like charges repel each other, we need Coulomb’s law to 
tell us that the force varies as 1/r?. The nuclear force, say between a proton 
and neutron, falls exponentially with distance. Surely Newton did not tell 
us this. But once we have experimentally determined a new force F, we can 
use his second law to deduce the a it would produce, assuming classical 
mechanics is applicable. 

When a new force, like the electric force, is discovered, the F due to 
it can be measured in one of two ways. One is to compare it to a known 
force that balances it. For example, to find how the repulsive force between 
two charged bodies varies with distance, we can glue them to the two ends 
of a spring of known k and measure by how much it is extended in equi- 
librium. Another way is to release the charges (of known mass) from rest 
with some separation, measure the initial acceleration of either, and then 
compute ma. 

So physicists are busy either finding a from F (as when computing 
the orbit ofa satellite given the force law for gravity) or F from a (as when 
stretching springs to find k or dropping apples from trees to measure the 
force of gravity). 

Now, a small digression on gravity. Consider a body near the surface 
of the earth, the force of gravity on it being F = — mg where g = 9.8ms~?. 
That’s something you find out by dropping things from a tower. Consider 
ain the field of gravity. We find it is 


a=——=-g (3.7) 


for all bodies. That’s a very remarkable property of the gravitational force— 
the cancellation of m. If you look at the electrical force, on the proton and 
electron for example, it’s not proportional to the mass of either object. It’s 
proportional to the electric charge of the object. Therefore, when you di- 
vide by the mass to get the acceleration, the response of different bodies is 
inversely proportional to the mass. But gravity has a remarkable property 
that the pull of the earth is itself proportional to the mass of the object. 
So, when you divide by the same m to find a, it cancels and everything 
falls at the same rate on the surface of the earth. In fact, that is a property 
of gravitational fields anywhere, even in outer space. Everything—gold, 
silver, diamonds, particles, elephants—all accelerate the same way. This 
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remarkable fact was known for a long time, but it literally took an Einstein 
to figure out why nature behaves in that fashion, why the two masses asso- 
ciated with a body are equal. One is the inertial mass, which is how much 
a body hates velocity change, how hard it resists acceleration, the mass in 
F = ma. That quality can be measured far from planets, far from every- 
thing. The other is gravitational mass, which is the measure of how much it 
is attracted to the earth or any other body. There’s no reason why these two 
attributes had to be equal. Is this just an accident or is it part of a big pic- 
ture? It turns out that it’s part of a big picture called the general theory of 
relativity. Here is Albert Einstein’s description of gravity. Imagine a stream 
and some kids dropping various leaves or paper boats in it. No matter what 
they drop (within reason) the object’s trajectory will follow the flow lines 
of the water. The path of all objects is predetermined. This is what gravity 
does to spacetime—it defines trajectories for objects: anything you release 
will follow the trajectory etched in spacetime. If you oppose this flow, like 
a kid holding on to his paper boat, the resistance you feel is the weight of 
that object. What determines the flow lines at each point? The matter and 
energy in the universe, as dictated by Einstein’s equations. 

Here’s a simple example of a complete Newtonian problem. A mass 
is attached to a spring. It is pulled by a certain amount x and then released. 
What is it going to do? Newton says F = ma, which in this case becomes 


— = —kx, (3.8) 


To proceed, we must know mand k, and I have already discussed how these 
are measured. Now we have a mathematically complete problem: find the 
function x(t) whose second derivative is equal to —* times the function 
x(t). Then, we go to the math department and say, “What’s the solution to 
this equation?” This is a problem in mathematics and the answer—that it’s 
going to be oscillating back and forth—will come from doing the math. 
Later we will do some of that math ourselves. For now, I am simply point- 
ing out that once we have stated the laws in mathematical form, solving for 
the consequences is a mathematical problem. 

Here is another example. Newton discovers a force of gravity acting 
on everything. Here’s the sun in Figure 3.2, orbited by a planet. At this 
instant, the planet may be moving at some velocity v. The acceleration of 
the planet is due to the force of gravity between the planet and the sun, 
which Newton tells you is directed toward the sun, proportional to the 
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Planet 


Figure 3.2 The planet is separated from the sun by r. The force F on it was 
determined to be always opposite to r, i.e., pointing toward the sun and falling as 
5. (The force is slightly offset from r for clarity.) 


product of the two masses, and which decreases with distance as L This 
completely specifies the left-hand side of F = ma. That’s the law of universal 
gravitation. Then again, because the second derivative of the position is 
connected to the position, you go to the mathematicians and say, “What 
orbit is the solution to that equation?” and they will tell you it is an ellipse. 

Of course, Newton did not have mathematicians he could go to. He 
was the math guy. Not only did he formulate laws of gravitation, he also 
invented calculus and figured out how to solve the differential equation 
that came out of his F = ma. There has been no one like him. Here I speak 
of Newton the scientist; Newton the man had many flaws. 


3.4 Newton’s third law 


The third law says that if there are two bodies, called 1 and 2, then F32, the 
force on 1 due to 2, is minus the force on 2 due to 1: 


Pig = =h. (3.9) 


Action and reaction are equal and opposite. Coulomb’s law and the law of 
gravity both have this feature. You may assume it for every force in this 
course. 

We are now going to put the laws to work. You have to be good at 
writing down the forces acting on a body. That’s what all these problems 
are going to boil down to. Do not forget the existing forces and do not make 
up your own forces. I have seen both happen. Every force, with the excep- 
tion of gravity, is a force due to direct contact with the body: a rope is 
pulling it, a rod is pushing it, you are pushing it, you are pulling it, one 
block is pushing another, and so on. Gravity is one force that acts on a 
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body without the source of the force actually touching it. (Later electro- 
magnetic forces can come in, but not in this book.) That’s it. Once more 
with feeling: With the exception of gravity all forces we will discuss in this 
book will be contact forces. 

We are going to begin with simple problems in mechanics. They will 
get progressively more difficult. Let’s start with our first triumph. There 
is some object of mass 5 kilograms and I apply 10 N on it. What’s the 
acceleration? Everyone knows it is a= 2 =2ms~*. You may have done this 
before, but I hope now you understand how we know the force is 10 N and 
how we know the mass is 5 kg. The algebra is, of course, very trivial here. 

Next, I have a 3-kg block placed against a 2-kg block, and Pm pushing 
the former with 10 N as shown in Figure 3.3. I want to know what hap- 
pens. One way is just to use your common sense and realize that these two 
blocks are going to move together. You know intuitively that if they move 
together, they will behave like an object of mass 5 kg and the acceleration 
will again be 2ms~*. What about gravity? What about the force due to the 
table on which the masses are moving? Imagine that this occurs in outer 
space where there is no gravity and no need for a table. 

There’s another way to do this problem, which is to draw free-body 
diagrams. Here you can pick any one body that you like and apply F = ma 
to it, provided you identify all the forces acting on that body. We'll first 
pick the 3-kg mass. My 10 N is certainly acting on it. What other force is 
acting? The force of the 2 kg, which has a magnitude f acting to the left. 


a =2 ms? 


-—— o 
10N 


Figure 3.3 Top: A force of 10 N acts on the two blocks, viewed as a single entity. 
Bottom: The free-body diagram for the two blocks showing all the forces on each 
block. Notice the third law is being invoked. 
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Do not include the force exerted by the 3 kg on the 2 kg. Next consider 
the 2-kg mass. There is the same f, but acting to the right by the third law. 
Here is the mistake some people make: they add to that the 10 Newtons. 
They feel that the 2 kg will surely feel it since that is what is behind all the 
acceleration. That will be a mistake. That’s an example of adding a force 
that you should not be adding. The only force acting on this little guy is 
this little f. 
Now we do F = ma for these two guys: 


10 —f =3a (3.10) 
f=2a (3.11) 
10=5a_ upon adding the previous two equations (3.12) 
a=2. (3.13) 


Notice Im using the same acceleration for both. I know that if the 
second mass accelerated faster than the first one, then the picture is com- 
pletely wrong; it will not feel the force due to the first. If it accelerated less 
than the first, the first would have plowed into the second. Since that also 
cannot happen, they're moving with the same acceleration. There’s only 
one unknown a. Once you got a = 2, you can go back to Eqn. 3.11 and 
obtain f = 4 N. Now we know the full story: 4 N acting on 2 kg gives it an 
acceleration of 2 ms~?, while (10 — 4) = 6 N acting on the 3 kg gives the 
same a = 2ms~*. 

Here’s another variation shown in Figure 3.4. I have a 3-kg mass at- 
tached by a rope to a 2-kg mass, which I pull with a force of 10 N. Again, 
your common sense tells you that I am pulling something whose effec- 
tive mass is 5 kg; the answer is 2ms~?. Let’s confirm that systematically by 
using the free-body diagrams in the lower half of the figure. Now, there 
are really three bodies here: the two blocks and the rope connecting them. 
In all these examples, I assume the rope is massless. We know there is no 
such thing as a massless rope, so what we mean is a rope whose mass is 
negligible compared to the two blocks being pulled. We’ll take the ideal- 
ized limit where the mass of the rope is 0. The 3 kg is being pulled by 
the rope to the right with a force that Im going to call T, which stands 
for tension. The rope is being pulled backward by the 3 kg with a force 
T by the third law. What is the force on the other end of the rope? What 
should that be? If you said T, that is right, but not because the rope would 
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E 10 N 


LoT 


—> T T 
E —> +| 2 |—10N 


a =2 ms2 


Figure 3.4 (Top) A force of 10 N pulls the two blocks connected by a massless 
rope, all treated as a single entity. (Bottom) The free-body diagram for the two 
blocks and the rope, which experiences equal and opposite forces of magnitude T 
called the tension. 


snap otherwise. Something else will be a problem. If the two forces on its 
ends don't cancel, you have a net force. What are you going to divide by 
to get the acceleration? Zero, right? So, a massless body cannot have a net 
force on it, because its acceleration would then be infinite. Massless bodies 
will always have equal and opposite forces on the two ends. In the case of 
the rope, this is called the tension on the rope. The tension is not 0 just 
because this T and that —T cancel. Suppose you are being pulled by my 
favorite animals—the elephants—from both sides by equal and opposite 
forces. You won't find any consolation in the fact that these forces add up 
to 0 as you get subdivided. 

Now, you can invoke F = ma for the three objects, starting from the 


left: 
P=3a (3.14) 
T—T=0a (3.15) 
10— T=2a. (3.16) 


We add the three equations and we get 10 = 5a or a = 2ms™? and T=6N. 
So, the tension on the rope is 6 Newtons. This is very important: when you 
buy a rope, the specifications will tell you how much tension the rope can 
take before it will snap; if your plan is to accelerate a 3-kg mass with an 
acceleration of 2 ms~*, you better have a rope that can take the tension 
of 6 N. 
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If you see the rope in isolation you find its acceleration is indetermi- 
nate according to Eqn. 3.15. This is correct; it is the non-zero masses that 
determine a for everybody, and the rope goes along with this a for free. 


3.5 Weight and weightlessness 


Now let’s see what happens when you ride an elevator. In Figure 3.5 you are 
the stick figure standing on the weighing scale on the floor of an elevator 
that has a positive (upward) acceleration a. What will the scale register? 
We will draw free-body diagrams as in the right half of the figure for you 
and the massless spring. The spring is being pushed down by you with a 
force W and up by the floor with the same force since it is massless. It will 
compress by an x such that W = kx, and x will somehow be displayed by a 
needle or digital readout. 

(We should note a subtle thing that you may not have realized. Every 
[massless] spring is pushed or pulled by equal and opposite forces +F at 
the two ends since otherwise it would have a = F/0 = oo. So which of 
these two appears in F = —kx? Recall the mass spring system. One end 


il Jf 


Figure 3.5 (Left) An elevator is accelerating at a rate a and you are standing on 
a scale whose spring is shown by a dotted line. (Right) The free-body diagram for 
you and the spring. The spring is compressed at each end by a force W. 
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of the spring is anchored to the wall. When we pull the other end by +x, 
we apply a force F = +kx to the right. The wall exerts a force F = —kx at 
the other end. In response to the +kx we apply, the spring exerts a force 
F = —kx on us or on the mass attached to it, and that is the F we use in 
writing down F = ma for the mass.) 

The equation describing you in the elevator is 


W — mg = ma, which means (3.17) 
W = m(g + a). (3.18) 


If the elevator is standing still and a = 0 we see that W = mg is just 
your weight. If the elevator is accelerating upward W = m(g + a), and the 
needle will show a number greater than your weight. You will actually feel 
heavier. The spring not only has to support you from falling through the 
floor but also has to accelerate you counter to what gravity wants to do. 
That’s why we have the g + a. Say you picked up some speed and are now 
coasting upward at a steady speed. Then, a = 0 again and W = mg. As you 
come to the top of the building, the elevator has to decelerate so that it can 
lose its positive velocity and come to rest. So a will be negative now. If a is 
negative, then g+a < g. Let us write in this case a = —|al, so that 


W = m(g— lal), (3.19) 


which makes it explicit that W < mg. You will feel that your weight is 
reduced. 

On the way down your initial acceleration is negative because you're 
picking up speed toward the ground. You will feel less heavy. You can see 
that if |a| = g, your downward acceleration is that due to free fall under 
gravity and you will feel weightless. This can happen when the cable has 
snapped. You don’t feel any weight because your normal weight is the op- 
position the floor offers to keep you from falling through it; but now the 
floor is also falling at the same rate. It’s wrong to think that when you 
feel weightless you have escaped the pull of gravity. We all know that in a 
falling elevator you definitely do not escape the pull of gravity. It’s going to 
catch up with you in a few seconds. The same goes for the people floating 
around in space stations. They have not escaped the pull of gravity either; 
they have just stopped fighting it, and they are all accelerating toward the 
earth at a rate ~ = g*, where g* is the (reduced) acceleration due to grav- 
ity at the radius r of the orbit. If they had really escaped the pull of gravity, 
their spaceship would be off to the far reaches of the universe. 


CHAPTER 4 


Newton’s Laws II 


4.1 A solved example 


The goal of physics is to be able to predict something about the future, 
given something about the present. I’m going to provide a very simple ex- 
ample that illustrates how Newton’s laws are to be used for this purpose. 
This treatment will be brief since I will return to this problem in greater 
detail later. 

Figure 4.1 shows a frictionless table, on which is a mass m attached to 
a spring of force constant k. The other end of the spring is attached to an 
immobile wall. The dotted outline of the mass shows it when it is displaced 
by an amount x from equilibrium. I want to pull the mass by some amount 
A and let it go. So that’s the knowledge of the present. What’s this guy 
going to do? That’s the typical physics problem. It can get more and more 


Figure 4.1 A mass m attached to a spring of force constant k in its equilibrium 
position, when the spring is neither extended nor compressed. The dotted outline 


of the mass shows it when it is displaced by an amount x from equilibrium. 
51 
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complex. You can replace the mass by a planet; you can replace the spring 
by the sun, which is attracting the planet; you can bring in many planets; 
you can make it more and more complicated. But they all boil down to a 
similar exercise: I have some information now and I want to be able to say 
what will happen next. 

Combining F = ma and F = —kx we obtain the equation that seals 
the fate of the mass: 


dx k 
— = —— x(t). 4.1 
dt? m ) (4.1) 


This is a differential equation: you have to find a function x(t) whose 
second derivative is r times itself. A differential equation tells you some- 
thing about an unknown function x(t) in terms of its derivatives, and you 
are supposed to find it given this information. Instead of running to the 
mathematicians, let us solve this equation by guessing, which is a totally 
legitimate way to solve a differential equation. 

Here is how we guess the answer. Let’s make our life simple by taking 
a case where k = 1, m= 1. Later on, we can put back any k and m. I’m look- 
ing for a function whose second derivative is minus that function. Now, as 
a word problem, it rings a bell, right? Do you know such a function? Expo- 
nential is good. Trigonometric functions are good too, provided you mean 
sin and cos. I’m going to dismiss the exponential, which is actually a very 
good guess. If you took x(t) = e*, we have 


d 
a =e! #—x(t). (4.2) 


It does not help to consider x(t) = e~‘ since the minus sign will get squared 
when you take two derivatives, and you will still end up with +x(t). Now 
e+! where i= ./—1, will work, but we do not want to deal with complex 
numbers yet. 

We just want a function that reproduces itself when we take two 
derivatives. So, I make a guess, 


x(t) = cost, (4.3) 
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and you can check that it works: 


2 2 
a = ao = — cost = —x(t). (4.4) 

Did you follow the way this very elementary problem is solved? It’s solved 
by making a guess. But something is not quite okay with this solution. If 
I put t = 0, I get x = 1. Why should it be true that I initially pulled it by 
exactly one meter? I could have pulled it by 2 or 3 or 9 meters. I want to 
be able to decide how much I pulled it by at t = 0. Suppose it was pulled 
to 5 m and released. I want x(0) = 5. This can be arranged by making a 
choice 


x(t) = 5cost. (4.5) 


Does the 5 screw up everything? It doesn’t, because it just comes along for 
the ride: 


2 2 

a = a = —5cos t = —x(t). (4.6) 
Now I have an answer that does everything I want it to do. At the initial 
time, it gives me 5 times cos0, which is 5; and thats what I said was the 
initial displacement. I find de = —5sin t, which vanishes at t = 0. That too 
is correct; I pulled it and let it go. So, the instant I released it, it had no 
velocity. It satisfies Newton’s laws, and that’s the answer. I want to do this 
simple example in totality because this is the paradigm. This is the example 
after which everything else is modeled. 

There are two final points about this solution. 

First, there is the other option x(t) = sin(t) that occurs to some stu- 
dents. There is nothing wrong with this option; it just is not needed in this 
particular example. Equation 4.5 is not the final and most general solution 
to the problem; it is a solution that certainly works for the one example I 
had, in which a mass is pulled to 5m and released. 

Second, when k and mare not both 1, we go back to the general case 


dx k 
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The solution to this is 


x(t) = Acos,|*, (4.8) 
m 


where A, which was 5 in our example, is arbitrary in general and is called 
the amplitude. We will deal with the oscillator in depth later. At this point, 
we just want to get a feeling for how F = ma is applied in real life, with 
some help from the mathematicians. 


4.2 Never the whole story 


Is this the whole story of the mass and spring, or is something missing? 
Does the mass oscillate forever as predicted by the Eqn. 4.1 we just solved? 
It does not: it eventually comes to rest. If you never knew about friction, 
and you solved for x(t) with just the spring force, you would find it didn’t 
work. So, you are at a fork in the road. You can say either that Newton’s 
laws are incorrect or that you are missing some forces. In the latter case, 
you will look around and eventually deduce the frictional force by insisting 
that F = ma works with its inclusion. We will see how this is done later in 
this chapter. 

It turns out that even if there were no friction, Eqn. 4.1 would not 
be the correct law, which is given by Einstein’s relativistic dynamics. By 
that I mean, if you really pull a real spring by 5m on a frictionless sur- 
face and release it, its motion will not be exactly 5cost; it will be off 
by a very, very, very tiny amount that you probably will not discover in 
most laboratories. But if the mass begins to move at a velocity that is 
comparable to that of light, then this equation will make the wrong pre- 
dictions. On hearing that even Newton can be wrong in this sense, some 
people say, “What kind of business are you in? Every once in a while 
some authority is proven wrong.” Pll tell you right now. Everything we 
know is wrong at some level. Newton didn't try to describe things mov- 
ing at speeds comparable to light. He dealt with the problems he could 
deal with at that time. So his laws have a limited domain of validity. 
You can always push the frontiers of observation until you come to a 
situation where any given law breaks down. While the special theory of 
relativity does better than Newtonian mechanics for large velocities, it 
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too fails if the mass becomes very tiny, of atomic dimensions. Then you 
need the laws of quantum mechanics discovered by Heisenberg, Dirac, 
and Schrodinger, and of course their laws also have problems in some ex- 
perimental domain. Sometimes we correctly abandon the formalism; but 
we should not be too eager to do that. In the present case, the problem 
is not with formalism but with not including friction. The failure at the 
quantum level comes not because we did not include some forces, but be- 
cause the very notions of force or trajectories x(t) seem to be invalid at that 
scale. 


4.3 Motion in d=2 


We're going to move on to higher dimensions, where position, velocity, 
acceleration, and force are all vectors. So let’s again start with simple prob- 
lems in d = 2 and make them more and more complex. There is no limit 
to how difficult mechanics problems can be. If you go back and read some 
Cambridge University exams from 1700 or 1800, you will find some really 
difficult problems. Finally, quantum mechanics was invented and life got a 
lot easier. 

Consider a mass m sitting on a table as shown in the left half of Figure 
4.2. Because we're in two dimensions, we need to have two axes x and y. 
Recall that 


F= ma (4.9) 


mg 


Figure 4.2 A mass m sitting on a table, (left) without and (right) with friction. 
The coefficient u = us if the block is at rest and u = wx if it is moving. 
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is a vector equation. If two vectors are equal, then their x and y compo- 
nents have to match: 


F; = max (4.10) 
F, = may. (4.11) 
y y 


I’m going to apply them to this block. There are no known forces acting in 
the x direction, and the block is not moving in that direction. Therefore 
it’s a case of 0 = 0. Now I look in the y direction. In the free-body diagram 
there is the force of gravity mg acting down the y-axis; I will often denote 
this vector by mg, where g = —9.8jms~*. If that’s all you had, the block 
would fall through the table. Because the block is not falling, we know 
the table is exerting an opposing force denoted by N, where N stands for 
normal. And normal is a mathematical term for perpendicular. Evidently 
N is a positive force and mg is a negative force in the y direction. Thus 


N — mg = may. (4.12) 


In this application of the Newtonian equation, I know the right-hand 
side. It is 0 because I know this block is neither sinking into the table nor 
flying off the table. It’s sitting on the table; it has no velocity or acceleration 
in the y-direction. I come to the conclusion 


N= mg. (4.13) 


4.4 Friction: static and kinetic 


Now I’m ready to introduce another force, the force of friction, f. How do 
we infer there is a force of friction? Consider the mass on the table in the 
right half of Figure 4.2. What experiment will tell you there is another force 
called friction? You find that to keep the mass moving at constant velocity, 
you have to apply a force. That means the force youre applying is canceled 
by something else, because there is zero acceleration. Here is another good 
answer: You give the mass a push and soon it stops moving; there must 
have been a force that produced the deceleration. But even before that, 
even before it starts moving, you find that, if you push it, it doesn’t move. 
Say I push the podium. If I push it gently, it does not move. But if I push 
hard enough it begins to move. What’s happening before it moves? I’m 
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applying a force and I’m getting nothing in return for it. So I know there is 
another force exactly balancing mine, and that’s the force of static friction. 
Let us call the force I am applying Fme. ’m applying it to the right, and so 
there has to be another force of equal magnitude to the left. Notice it is not 
a fixed force: it’s whatever it takes to keep the mass from moving. It will 
not be less than what I apply, because then the mass will move; it cannot 
be more than what I apply because then it will start moving me backward. 
So, static friction is a force that has a range from 0 to some maximum. The 
maximum turns out to be 


f= uN (4.14) 


where us is a number called the coefficient of static friction and N is the 
normal force. In our example you may use N = mg. So the force of friction 
seems to depend on how heavy the object is that’s sitting on the table. But 
it doesn’t depend on the area of contact. For example, it does not increase 
if we rest the block on another face with greater area. You might expect 
more friction because there’s more contact. But that is not so, for reasons 
not readily explained within our elementary treatment. In reality, friction 
has a subtle origin at the atomic level. 

If static friction provides a force equal to the force that I apply, up 
to a maximum of usN, what happens when I exceed the maximum? The 
object I am pushing will start moving. Once it starts moving, the frictional 
force, which is always directed opposite to the velocity, changes to 


fe = uN (4.15) 


where ug is the coefficient of kinetic friction. We find that in all situa- 
tions 4k < Ms. Let us say us = 0.25 and ug = 0.2 for the block on 
the table. If I apply a force that is 0.1 times its weight, it wont move; 
at 0.2 times its weight it won’t move; at a quarter of its weight it’s 
a tie; at 0.26 of its weight the block will start moving. Once it starts 
moving, the frictional force will drop to 0.2 times its weight, and the 
body will accelerate if I maintain the same force as when it first begins 
moving. 


4.5 Inclined plane 


Now we are going to do the one problem that has sent more people away 
from physics than anything else. It is called the inclined plane. A lot of 
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people who do not remember where they were during the Kennedy assas- 
sination say, “I remember the day I saw the inclined plane; that’s the day I 
decided I’m not going into physics.” This is very bad publicity for our field. 
You come into a subject hearing about relativity and quantum mechanics, 
and we hit you with this. So why am I still doing this? Because this is the 
entry ticket into the business. It is inconceivable to me that you could un- 
derstand more advanced topics without being able to understand this one. 
Go ahead and mock the inclined plane, but only after you can prove you 
have mastered it. 

Here is the notorious problem. There is a mass m sitting on a plane, 
inclined at an angle 0 as shown in the left half of Figure 4.3. We know 
it’s going to slide down the hill, but we want to be more precise. The 
whole purpose of Newton’s laws is to quantify things for which you al- 
ready have an intuition. The novel thing about the inclined plane is that 
for the first time we are going to pick our x and y axes not along the 
usual directions, but parallel and perpendicular to the incline. What are 
the forces on this mass? I have told you, first deal with contact forces. But 
the only thing in contact with the mass is the plane. In general the in- 
clined plane can exert a force along and perpendicular to its own surface, 
but Pm first going to take a case where there is no friction. By definition 
the plane cannot exert a force along its own length, so it can only exert 
a normal force N. Then there’s only one other force, the force of gravity 
that we agreed we have to remember. Even though the earth is not touch- 
ing this block, it is able to reach out from down below and pull this block 
down. These are the two forces, and the mass will do what these forces tell 
it to do. 


Figure 4.3 A mass m sitting on an inclined plane, without friction (left) and 
with friction (right). 
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First I have to take this mg pointing down and break it up into com- 
ponents along my x and y axes. That’s called resolving the force into various 
directions. Now, the key to this is to know that this angle 0 of the plane is 
the same as the angle between vertical and the normal to the plane. You 
have to agree that if I draw a line perpendicular to the horizontal and 
I draw a line perpendicular to the incline, the angle between those two 
perpendiculars will be the same as the angle between the two lines with 
which I began. This is because “make perpendicular” means “rotate by 5.” 
If you rotate both lines by 5, the angle between the rotated lines will be 
the same. The vertical is perpendicular to the horizontal, and the normal 
perpendicular to the inclined plane. Once you understand that, the rest 
is easy. 

So here are my equations: 


mg sin@ = max (4.16) 
N — mgcos@ = may. (4.17) 


Now we know that this block is sliding down the incline. It’s not going 
into the plane, and it is not flying out of the plane. That’s the reason we 
measure our y coordinate perpendicular to the plane: it does not change, 
unlike the traditional x and y, both of which do. So ay = 0 in Eqn. 4.17, 
which implies 


N= megcosé. (4.18) 
Then you come to Eqn. 4.16. We cancel m and read off the answer 
ax = gsing. (4.19) 


That’s the big result, that the mass will slide down the hill with an acceler- 
ation gsin@. This provides a good way to measure g, because if you drop 
something vertically, it falls too fast for you to time it. But if you let it go 
down an incline, by making 0 very small, you can reduce the acceleration 
by a factor sind. 

Here’s another thing I should tell you right now. The professionals 
don’t put in numbers until the very end. So, if you are told the incline is 
an angle of 37 degrees, and g = 9.8 msec, don’t start putting numbers 
into the first equation. I know for some of you it’s traumatic to work with 
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symbols. There are many reasons to work with symbols rather than with 
numbers till the end. First of all, if you have already put the numbers in and 
I suddenly tell you, “Hey, I was wrong about the slope, it’s really 39 degrees 
and not 37,” you are forced to do the whole calculation again. But if you 
use symbols to derive the formula for a, and then ask me, “What’s your 
0?” you can just plug that in. Likewise, if I change the value of g because 
someone made a better measurement, you simply change g in the very last 
expression for ax. You can also see if your answer had some mistakes in it. 
Suppose you got ay = g* sin 0. You will know it’s wrong because the units 
do not match. Maybe you have the trigonometry wrong. Maybe it’s really 
ax = gcos@? You can do a few tests. For example, as the plane becomes 
less and less inclined, you have to get less and less acceleration, and when 
0 — 0, so must ax, which must then be proportional to sin@. Or, if you 
make the incline almost vertical, the block is just falling under gravity with 
ax = g. The result a, = gcos@ does not do that when we set 0 = 5. Finally, 
ax = gsin@ is independent of m. That’s an interesting property of the result 
you would not notice if you kept numbers everywhere. So we will agree to 
work with symbols until the end. 

Now we are going to make life a little more complicated by adding 
friction, as shown in the right half of Figure 4.3. Let the block be at rest 
and imagine there are some hinges on the inclined plane that allow me to 
increase 0 from 0 to 5. I want to know when the block will begin to slide 
down. Let me cut to the chase and write the equations: 


N — mgcos@ = may (4.20) 
mg sin@ — f = max. (4.21) 


The first equation is the same as before and tells us that since ay = 0, 


N= mg cosð. (4.22) 


In the second equation, do not write f = usN because the frictional force 
is not always uN; it is whatever it takes to keep the block still, up to a 
maximum of uN. In other words, if 0 is very small, the frictional force will 
in fact be a much smaller amount f = mgsin@, which you get by setting 
ax = 0 in the right-hand side. 
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Let us crank up the angle to 6*, beyond which it cannot stay still. At 
this angle the friction is maximum and we can assert 


mg sin@* = usN = usmgcos6™, (4.23) 
which gives us a way to measure 4s: 
s= tang": (4.24) 


Again, the mass cancels. So, it doesn’t matter how heavy a car you parked 
on the slope, as long as us was the same. But g cancels too. So, whether you 
park your car on the earth or park it on another planet, the same restriction 
tan < us applies. 

When we pass this limit, when tan > us, the block will begin to 
slide down. It will now have a non-zero downhill acceleration because it 
is kinetic friction uN that now opposes the velocity and wu, < ws. The 
operative equations are 


N — mgcos0 = may (4.25) 
mg sin@ — uN = max. (4.26) 


What’s the acceleration now? I take the y equation that says as before 
that N = mgcos@ since a, = 0, and plug that into the x equation to obtain 
the downhill acceleration of the block: 


mg sin@ — umg cosh = max (4.27) 
g(sind — ugcos0) = ay. (4.28) 


4.6 Coupled masses 


Next is a problem of two masses m and M connected by a string that goes 
over a massless pulley mounted at the top of a frictionless inclined plane, 
as shown in Figure 4.4. What can we say before doing the math? If M > m 
am I assured it'll go downhill? No, it depends on the angle 0 because only 
part of its weight, Mg sinð, is helpful in going downhill while all of mg is 
pulling m down. All we know for sure is that for a fixed 0, the bigger mass 
M will go down the slope as M —> oo while the smaller mass m will go 
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Figure 4.4 (Left) Two masses linked by a massless rope and massless pulley. 
(Right) The free-body diagram. 


straight down as 0 — 0. Let us figure out now when exactly the balance 
shifts. 

Since we have two masses here, no choice of axes is going to make 
all their motions simple: m goes straight up and down while M glides 
along the slope. Let us ignore the boring N = Mgcos@ equation for the 
y coordinate and focus on the motion along the plane. In the free-body 
diagram I have shown the same tension T wherever the rope is involved. 
For a straight massless rope it is clear the opposing forces at the ends must 
have the same magnitude T since there can be no net force on a massless 
object. For a rope that curves around the pulley, the explanation is more 
subtle, as I will explain momentarily. 

For M, the equation along the x-direction is 


Mgsin@ — T = Ma. (4.29) 
For m the equation in the up direction is 
T — mg = ma (4.30) 


where I have deliberately used the same a as in Eqn. 4.29. I am not saying 
the acceleration of M and mare equal as vectors. They do not even have the 
same direction: m moves up and down while M moves along the plane. By 
a I mean the components of the two accelerations in the allowed directions 
of motion, defined to be positive if M is moving downhill and m is moving 
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up. This equality reflects the inelasticity of the rope: if m moves up one 
inch, M will slide down one inch along the incline. If the magnitude of 
the displacement is the same, so are the magnitudes of the velocity and 
acceleration. We can solve for a by adding Eqns. 4.29 and 4.30 to obtain 


Mgsin@ — mg = Ma+ma_ which means (4.31) 
Msin0O—m (4.32) 
a= g| ———— |. ; 
s m+M 


Does this solution make sense? Now you notice that for a to be positive and 
for M to go downhill, it’s not enough that M > m; we need Msin@ > m 
because only the force Mgsin@ is pulling M downhill: the force Mg cos@ is 
trying to ram it into the inclined plane, and that’s being countered by the 
normal force N. 

Can we use this formula when a < 0, that is, when M moves uphill? 
After all, I did all my analysis assuming it’s going down. Yes, we can. All the 
forces I drew here—N, Mg, and so on—are not going to change when you 
change the body’s direction of motion. Gravity is always going to pull down 
whether the block moves with it or against it. So once you have a formula 
for positive a, you can apply it to negative a as well. However, if friction 
is present, you cannot do that because you have to assume a particular 
direction of motion before you can assign a direction to the frictional force 
in the equations of motion. 

Now let’s consider why the forces at the two ends of the rope emerg- 
ing from the pulley have equal magnitudes T. Assume that the pulley is 
massless and that the rope does not slip; that as the masses move and rope 
moves over the pulley, the pulley is forced to rotate without relative slip- 
page. In this case the part of the rope instantaneously in contact with the 
pulley and the pulley itself may be viewed as one rigid body. (It is like a 
bicycle chain whose links mesh with the teeth on the wheel the pedal is 
turning.) The tensions on the rope at the two ends are clearly trying to 
rotate this body in opposite directions, as is clear in the free-body dia- 
gram in Figure 4.4. If these tendencies are not exactly balanced—that is, 
if these forces are not equal in magnitude—they would produce an infi- 
nite rotational acceleration of this massless rigid body, just like a non-zero 
force would produce infinite linear acceleration on a massless object. By 
the no-slip condition, this means an infinite linear acceleration for the two 
masses, which is impossible because the masses are non-zero and have 
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finite forces acting on them. So the forces at the two ends of the rope 
emerging from the pulley must have equal magnitudes. (We will return 
to a proper study of rotations and the case of the massive pulley in Chapter 
10.) Note that even though the rotational effects of the two non-parallel 
forces cancel, their non-zero vector sum can produce linear acceleration of 
the pulley. The axle about which the pulley rotates provides an equal and 
opposite force to prevent this. Luckily this last force does not contribute to 
rotations around the axle. 


4.7 Circular motion, loop-the-loop 


Now we'll turn to some interesting problems in circular motion of vari- 
ous kinds. The first, shown in part A of Figure 4.5, describes a ride in an 
amusement park. You sit in these baby rockets hanging from a rope along 
with other petrified victims. The whole thing begins to spin, and the rope, 
instead of remaining vertical, starts tilting at an angle 0, which we want 
to determine as a function of the tangential speed v and radius R of the 
circular orbit. We apply F = ma and start listing the forces on the baby 
rocket. Gravity provides the usual mg pointing down. The rope can only 
exert a force T along its length. Let us trade the tension along the rope for 
the sum of two equivalent forces, a vertical part, T, = T cos, and a hor- 
izontal part, Ty = Tsin@ directed toward the center of the circular orbit. 
The equations are 


Tcosð —mg=0 vertical (4.33) 
2 
Tsind = ma = m radial or horizontal. (4.34) 


In the first equation, we recall that by assumption the rocket’s orbit is a 
horizontal circle and it has no net acceleration in the vertical direction. In 
the second, we recall from Section 2.6 that a body moving in a circle of 
radius R at speed v has a centripetal acceleration r; So this is a case where 
we know a in F = ma. Eliminating T we find 


tan? = —. (4.35) 
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A B 


Figure 4.5 (Left) An “amusement” ride in which you go around in a horizontal 
circle of radius R in a baby rocket. The rope supporting you necessarily makes 
an angle 0 = tan”! = with the vertical. (Right) A car going around a circular 
racetrack of radius R at speed v. The road has a banking angle that obeys 


2 a : 
tané = Re The normal force N has a horizontal part that gives the necessary 


force oe to bend the path into a circle. The symbol ® is used in both parts 
to indicate that the rocket or car is going into the page, away from you. The 
convention is based on how an arrow with feathers would appear going away 
from you. Likewise © indicates an arrow coming out of the page toward you. 


It is worth finding the tension on the rope since our lives may depend on 
it. It is 


2 
T= mg,| 1+ (+) (4.36) 


using T= ,/TZ + Ty. 

Here is another interesting problem. You are driving on a circular 
racetrack of radius R at a speed v. Suppose the plane of the road is strictly 
horizontal, perpendicular to g. Some agency has to exert a force m% on 
the car to bend it into this circle. It is of course the road that does this, 
thanks to the frictional force f < usmg directed toward the center. (I use 
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Ls, the static coefficient, and not up, the kinetic one, even though the car 
is moving, because we are discussing the force in the radial direction and 
the car has no velocity in that direction, unless it is skidding. Note also that 
the car does not really have to travel in a circle; all we need is that at this 
instant the trajectory is part of some circle of radius R.) 

If you don’t have the requisite static friction, if uw;mg < m$, your 
car will not be able to make the curve; it will fly off. But there is a clever 
way in which you can still make the turn without any friction, and that 
is to bank your road by an angle 0 as shown in the right half of Figure 
4.5. Imagine now you're going into the paper. The frictionless road only 
exerts a normal force N. Let us resolve that force into a vertical part Ncos0 
and a horizontal one N sin directed toward the center of the circle. The 
equations are 


N cos = mg (4.37) 
a 
Nsin = mr (4.38) 


Eliminating N we find the banking angle to be 


z 
tand = —. (4.39) 
Rg 
Let me elaborate. You want the car to go around the bend at a certain 
speed. If you bank your road at that angle, you don’t need any friction to 
make the turn. Even though the frictionless road can only exert a normal 
force, thanks to banking, a part of the normal force points toward the cen- 
ter, providing the requisite centripetal force. Of course, when you drive on 
a real road, you do not have to travel at exactly this speed for a given R, for 
any small differences will be made up by the frictional force of the tire. It 
is just that you do not want to rely on friction for the entire radial force. 
Finally, the famous loop-the-loop problem, which defies common 
sense, is shown in Figure 4.6. You come down on a roller-coaster track 
from some height H, you go on a vertical circle, and for a while you 
are upside down. The eternal question is, “Why don’t you fall down?” 
We'll find we can understand this phenomenon fully with Newton’s laws. 
The forces on the coaster are mg acting down and the force of the track, 
which has to be normal to it since it is assumed to be frictionless. But 
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V-Av/2 


Figure 4.6 The roller coaster comes down from a height H and goes into a loop 
in the vertical plane. Why does it not fall down? The forces on it are mg acting 
down and the track force N also acting down! It does fall, as explained in the text. 
The three arrows forming a triangle in the inset show the initial velocity 

v— Av/2 just before it reaches the top, the change Av in a small interval near the 
top, and the final velocity v + Av/2 just after it passes the top. 


it too points down! We are doomed! Why don’t we fall? The answer is 
that we do fall, that is, we accelerate downward, but this does not mean 
we get any closer to the center. The two forces mentioned combine to 
bend the coaster into a circular path and produce the requisite downward 
centripetal acceleration: 


2 


N + mg = m (4.40) 
Solving for N we find 
N a (4.41) 
=mi|—— i 5 
R E 


If N comes out positive, that is, points down in our convention, 
f Er ‘ ; Pn 
which happens if & > g, we are safe. If it comes out negative, that is, if 
y 


— 2% 4.42 
r: (4.42) 
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it means the track exerts an upward force, which is impossible, unless there 
is some other mechanism, like a T-bracket, that goes under the track and 
supports the coaster even if it is just hanging upside down. I believe such 
things exist in real roller coasters, in case they get stuck at the top or do 
not go fast enough. In our idealized coaster, without any of this backup, 
the speed v must obey 


v? > Rg (4.43) 


to safely make the loop. 

We will figure out the minimum height from which it must be re- 
leased to satisfy this condition when we derive the law of conservation of 
energy. 

Let us be sure to understand again why accelerating down does not 
always mean gaining speed toward the earth. If you drop an apple, acceler- 
ating down means really picking up speed toward the ground, toward the 
center of the earth. It starts with zero vertical speed and picks up speed. In 
our example, the coaster is also accelerating, but the tiny change in velocity 
in a tiny time, which points radially down in Figure 4.6 near the top of the 
loop, is now added to a huge horizontal velocity pointing to the left. 

We will now see that this implies the velocity has a constant magni- 
tude and changing direction. Consider the impact of adding a tiny change 
Av to a velocity v on the magnitude of velocity. Assume Av can be in 
any direction relative to v. The resultant velocity has a magnitude squared 
given by 


Iv + Av|? = (v+ Av)- (v+ Av) (4.44) 
=v-v+2v: Av+ Av: Av (4.45) 
= vy +2v-Av+|Avi? (4.46) 
Av? =2v- Av+|Av|? (4.47) 
Ogee (4.48) 

dt dt 


(To find ay we need to keep just the term linear in Av.) This equation 
generally implies a non-zero rate of change of the magnitude of v, unless 
Av and the acceleration a are perpendicular to v, as is the case at all times in 
circular motion. 
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Figure 4.7 You fire two bullets from a tower at increasing speeds, which land 
farther and farther away (points 1 and 2). Beyond a critical speed, the bullet 
would go into orbit. It has of course never ceased to accelerate toward the earth. 


So a little later, the total velocity vector merely gets rotated with no 
change in length and becomes tangent to the circle at a slightly different 
point. Going around in a circle is an example of constantly accelerating 
toward the center but not getting any closer. 

Here is another example of this phenomenon. Suppose you are on a 
tower and you fire a gun horizontally as shown in Figure 4.7. The bullet 
hits the ground at point 1, under the pull of gravity. If you fire another 
bullet at a greater velocity it will land a little further away, at point 2. While 
greater initial speed will extend the time of flight even on a flat earth, the 
flight is further enhanced by the earth curving under the bullet. There will 
be a certain speed at which the bullet will keep falling but will not get any 
closer to the center, because the earth is falling under it just as fast. It is in 
orbit, as shown by the circle. That is in fact how you launch a bullet into 
orbit. So, what’s the first thing you should do when you fire this gun? Move 
away, because it’s going to come back in about 84 minutes and get you 
from behind at 17,650 miles per hour. This calculation assumes that there 
is no atmosphere, which of course leaves you with an even more pressing 
problem. 


CHAPTER 5 


Law of Conservation of Energy 


5.1 Introduction to energy 


The law of conservation of energy is a robust and powerful one. When 
the laws of quantum mechanics were discovered in the subatomic world, 
many cherished notions were abandoned. You must have heard the ugly 
rumors: particles do not have a definite position and definite velocity at 
a given time. They don’t move along continuous trajectories. You might 
think the particle must have had an interpolating trajectory connecting 
two sightings, but it does not, and to assume it does causes conflict with 
experiment. While many of the ideas of Newtonian mechanics were aban- 
doned, the notion of a conserved energy survived the quantum revolution. 
There was a period when people were studying nuclear reactions, and the 
energy they began with didn’t seem to be the energy they ended up with. 
Niels Bohr, the father of the atom, suggested that maybe the law of conser- 
vation of energy was not valid in quantum theory. Then in 1931 Wolfgang 
Pauli decided to put his money on the law of conservation of energy; he 
postulated that some other tiny electrically neutral particle, which escaped 
detection, was carrying away the missing energy. That was a radical po- 
sition to take in those days, when people did not lightly postulate new 
particles, as compared to today when if you don’t postulate a few new par- 
ticles you don’t get your PhD in particle physics. Pauli’s particle, called a 
neutrino, was detected after many, many years in 1959 by Clyde Cowan 
and Frederick Reines. Nowadays, neutrinos are one of the most exciting, 
elusive, and mysterious things one could study, and they hold the key to 
many puzzles concerning the universe. 
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Let’s see how the notion of energy conservation arises, starting with 
one dimension. When a force acts on a body, it changes its velocity. In one 
dimension, this simply means speeding up or slowing down. Were going 
to find the relation between the speed accumulated when a force acts on a 
body for some time and the distance the body has traveled in that time. 

Consider the case when the force F is constant, not varying with time. 
That produces an acceleration a = E, In the first chapter we learned that 
if a body has constant acceleration a, 


v= v6 + 2a(x — xo) (5.1) 


where v and x are the current velocity and position, and vo and xo the cor- 
responding initial values. In those days of kinematics, we didn’t ask, “Why 
does it have a constant acceleration?” We were just told, “It has a con- 
stant acceleration; just find out what happens.” Now that we have learned 
dynamics, we know acceleration has a cause, namely some force. So Pm 
going to replace a by Z and make one more cosmetic change in notation. 
All the initial (final) quantities will carry a subscript 1 (2). Then Eqn. 5.1 
takes the form 


F 


where d = x — xp = X2 — x; is the distance traveled during this interval. 


5.2 The work-energy theorem and power 


Equation 5.2 says that when the force acts on a body, it changes the velocity, 
and the change depends on how far the force has been acting, on how many 
meters it has been pushing the object. The change is not simply in velocity 
but in velocity squared. Let us move everything involving the force to one 
side and the particle to the other side: 


aie — i = Fd. (5.3) 
2 2 


The combination imv is called kinetic energy and is denoted by K. The 
product Fd is called the work done by the force and denoted by W. The 
units of W are Newton-meters and we replace that by joules or J. 
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We have found the simplest version of the work-energy theorem: 
Kı — Kı = Fd = W. (5.4) 


What if there are 36 forces acting on the body? Which one should 
I use? Say I’m pulling and yowre pushing. Then, F has got to be the net 
force, because Newton’s law connects the net force to the acceleration. If 
you and I have a tug of war, and we cancel each other out to zero, then 
there’s no acceleration, and the body with some initial velocity will main- 
tain its initial velocity. The work-energy theorem says, The change in kinetic 
energy is equal to the work done by all the forces. In this case, there is no 
change in energy, though there are two forces at work that cancel. It turns 
out that it is sensible to define the work done by me, which is equal to 
F times the distance traveled, and the work done by you, which will be 
given an opposite sign. When do we attach a plus sign and when do we 
attach a minus sign? If you go back and review the whole derivation, you 
will see it was understood that a was a positive quantity. Then everything 
works. If the body is moving to the right, and I’m pushing to the right, 
then the work done by me is positive. And if you were pushing to the 
left and the body still moved to the right, the work done by you is neg- 
ative. In other words, if you get your way, if things move the way youre 
pushing, the work done by you is positive. If the object is moving counter 
to your will, in the opposite direction to your force, the work done by 
you is negative. I’m lifting this piece of chalk at constant velocity from 
the ground. Its kinetic energy is not changing. That means the total work 
done on the chalk is zero, but not because there are no forces on it. There 
is gravity acting down, and I am countering gravity with exactly mg. The 
work done by me is positive because I want the chalk to go up, and it does. 
If it goes up by an amount h, the work done by me is W = mgh. The 
work done by gravity in the meantime is — mgh and the total work done is 
Zero. 

Let us imagine all of this occurs in a time At and the distance moved 
is d = Ax. Then if you divide both sides of Eqn. 5.4 by At and take all the 
usual limits 


= F— = Fv=P (5.5) 


where P is defined as the power. So power is the rate at which work is done. 
For example, if I climb a twelve-story building, I have done some work, 
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my mg times the height of the building. I can climb the building in one 
minute; I can climb the building in one hour. The work done is the same, 
but the power is a measure of how rapidly work is done. That’s why it’s the 
product of force and velocity or work divided by time. The units for power 
are joules per second. That also has a new name, which is watts or just W. 
You may use a kilowatt, or kW, which is a thousand watts. So, if you have 
a 60 W bulb, it’s consuming energy at the rate of 60 joules per second. 

Now let’s turn to the next generalization, when the force is not a con- 
stant but varies with x. Do we know any example of a force that varies with 
x? A spring is a good example, with F(x) = —kx. Even gravity is a good 
example, if we consider large distances. I think it is no secret that the force 
of gravity is mg only near the earth and that if you go sufficiently far, you 
will notice gravity itself is getting weaker. It will still look like mg’ locally 
but g’ won't be a fixed 9.8ms~*; it will be decreasing as we move away from 
the earth. 

What is the work-energy theorem when the force varies? Let’s draw 
ourselves a force F(x) that varies with x, as in Figure 5.1. Pm just taking 
any function of x that I want. Now the acceleration is not constant because 
the force is not a constant. We cannot apply the formula W = Fd because 
F varies over the distance d. So we resort to the usual trick in calculus: find 
an interval of width dx that is as narrow as you want, so that during that 


F(x) 


F(x) 


X1 dx X2 


Figure 5.1 When a variable force F(x) acts on a body that moves by a distance 
dx, it does work dW = F(x)dx shown by the shaded rectangle. 
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period F is essentially a constant equal to F(x), the value of F at that x. For 
that tiny interval I can still say that the change in K is 


dK = F(x)dkx. (5.6) 


Geometrically, F(x)dx is the area of the thin rectangle whose base is 
dx and whose height is the function F at that x. (If F(x) < 0, the area is 
counted as negative.) If you eventually went from x; to x2, then the work 
done by the force is given by the area under that graph in the sense of 
calculus. In every segment you pick up the change in kinetic energy dK, 
you add it all up to get Ky — Kı from the left-hand side. The right-hand 
side is the integral of the function F(x) between x; and x). The general 
work-energy theorem now says 


Kk -K = | i F(x)dx = W. (5.7) 


1 


Even if you have never heard of an integral, if I give you a function you 
can still deal with this problem. You'll come to me and say, “Give me your 
function. lm going to plot it on some kind of graph paper with a grid 
on it, and I’m just going to count the number of tiny squares enclosed. 
That’s the area and that’s the change in kinetic energy.” So integration is 
just finding the area bounded by the function at the top, the x axis below, 
and two vertical lines at the starting and ending points x; and x. 

Now here is a little digression, a three-minute introduction to a great 
secret for finding the area. If you give me a function and you tell me to find 
the area under it from x; to x2, I can show you a trick. You don’t have to 
draw anything on graph paper. First, I find a function G(x) specified by its 
derivative 


dG 
F(x) ==. (5.8) 


Then I claim (and will prove shortly) that 
x2 
J F(x)dx = G(x2) = G(x). (5.9) 
x 


This is the opposite of taking derivatives: now we want a function 
whose derivative is given to be F. If I say F(x) = x°, then G(x) is that 
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function whose derivative is x°. Now, I know I have to start with x* because 
when I take derivatives, I will lose a power. But I will also get an unwanted 
4 in front. I fight that by putting a 4 downstairs and find G(x) = a You 
might point out that if you have a function whose derivative is something, 
adding a constant doesn’t change the derivative. Then you can say, “Well, 
we are in trouble now because the world cannot agree on what G is, be- 
cause if I have one G, you can get another one with a different constant.” 
But the beauty is that when you take G(x) — G(x)), this difference in the 
choice of c goes away. So most of the time we don’t bother with the con- 
stant. Sometimes we make a special choice that recommends itself, as you 
will see when we study gravity on a celestial scale. The choice of constant 
is like the choice of origin in the projectile problem: any origin will do, but 
it is convenient to choose it as the point of take-off. 

Why is G(x) the function whose derivative is F(x)? Let us call the area 
from some arbitrary point xp to the point x as G(x). If I add a little more 
area, out to x + dx, the extra area is F(x)dx. This, by definition, is dG, the 
change in G. Dividing by dx and taking the limit, we see F = ag How do 
different G’s differing by a constant arise? By choosing different starting 
points xo from which to reckon the area. But whatever we choose for xo, 
the change in the area due to changing the upper limit x is always given by 
dG = F(x)dx. 


5.3 Conservation of energy: K? + U2 = Kı + U; 


Let us now incorporate what we have discussed so far and write 


K — Kı = [ F(x)dx = G(x2) — G(x) = Œ — Gi, (5.10) 


1 


which we can rearrange to give 
K- G@ =K- G. (5.11) 


We now make a little cosmetic change, and introduce the function 


d 
U(x) =—-G(x) F(x)= = (5.12) 
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in terms of which we obtain the standard form 
Fb = K + U = Kı + U = E. (5.13) 


This is the law of conservation of energy; E = K + U is called the total me- 
chanical energy and U is called the potential energy. Conservation of energy 
in physics has a totally different meaning from “Turn the lights off when 
you leave the room!” Here it means that when a body is moving under the 
effect of this force F(x), even though it is speeding up and slowing down, 
a certain special combination 


1 
E= sm + U(x) (5.14) 


does not change with time. If you know the value of E at one time, you 
know E at all times. 

Let’s consider a simple example. We take a rock, and we drop it. We 
know it’s picking up speed; we know it’s losing height. So, you may ex- 
pect there is some quantity that is a combination of height and speed, 
a combination which does not change in this exchange. We can find 
that combination by this law. In the case of gravity where F = —mg the 
expression for U is 


U = mgy since it obeys (5.15) 
dU 

—-— =-mg=F. (5.16) 
dy 


The energy conservation law takes the form 
il 26 1, 
y= 5 Va + mgy = 5M + mgy = E. (5.17) 
In the mass and spring system, the corresponding relations are 


1 
U(x) = 5 ke since it obeys (5.18) 


B E (5.19) 
dx 
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and the conservation law assumes the form 


ie a ee ee ee 
Ey Si, eg = Sm oy = EL, (5.20) 


Let us put that to work. I’m going to pull the mass by an amount A 
and let it go. I want to know how fast it will be moving when it comes to 
some point, say, x = 0. If you go back to Newton’s laws, this is a pretty 
complicated problem. Think about why. You start with a mass at rest. If 
you pull it by an amount A, a force —kA initially acts on it. That will pro- 
duce an acceleration, —kA/m, which will give it a small negative velocity 
by the time it moves a distance Ax to the left. But once it comes to the new 
location, a different force will be acting on it, because the x is now differ- 
ent from A. So the acceleration during the next tiny interval is different, 
and the gain and the velocity during that second interval will be different 
from that in the first. You have to add all these changes to find the velocity 
at x = 0. That’s a difficult proposition, but with the law of conservation 
of energy youll do that in no time. Let’s do it, not just for x = 0, but an 
arbitrary x. We set x; = A, vı = 0 at the start and drop the subscript 2 on 
x and v at the generic point to obtain 


1 1 1 
sin’ + Ske’ = 0+ Ska’, (5.21) 


At the initial time, all the energy is potential energy; there is no ki- 
netic energy because there is no motion. At any subsequent time, we can 
solve for the velocity at any location x. If you want x = 0, that’s very easy: 


l > 1 2 a 
= —k-0°=0+—kA 5.22 
ae t3 +5 ( ) 


2 kA? 
= (5.23) 


y= 
m 
| kA2 Ik 
v= +Ł+,| — =+A,/ —=+0A. (5.24) 
m m 


We get two answers because the mass may be going past the origin in either 
direction. If it is passing the origin for the first time, it will be moving to 
the left with a negative velocity. 
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For the case of general x, Eqn. 5.21 tells us 


v(x) = "E /B2, (5.25) 


You can see why finding the velocity is so much easier now; it is be- 
cause we have calculated the kinetic energy change produced by the spring 
between two points xı and x, once and for all by doing the integral of the 
spring force and encoding that in U (x1) — U(x). 

Now, let us take another problem: a mass is hanging from the ceiling. 
Let us choose the origin of the vertical coordinate y at the lower end of the 
undistorted spring, so that the force due to the spring is —ky. There will 
now be two kinds of potential energy, the gravitational and spring-based, 
because there are two forces: 


n 
Bek= J F(y)dy (5.26) 

yı 

z2 

= J (—mg — ky) dy (5.27) 

yı 

1 

= mg(vi — ya) + sk(yi —¥3)> (5.28) 


which can be written as 


1 1 
Ky + mgy + sh = Kı + mgyı + zwi- (5.29) 


5.4 Friction and the work-energy theorem 


One bad apple ruins the whole thing: friction. Let’s take a body with the 
spring force —kx and a frictional force f acting on it. I will try as usual 
to get a law of conservation of energy when friction is acting, and you 
will see that I will not succeed. It looks like everything here is water- 
tight, right? You tell me all the forces acting on a body. I integrate each 
force from start to finish and call that the difference of the correspond- 
ing potential energies, as I did in the case of a vertical spring-mass system 
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subject to the spring and gravity. Then I have got my result. So here 


we go: 
K-K = F(x)dx = f koas f o 
(5.30) 
= zed = she + J 7 f(x)dx (5.31) 
(K+ 543) —(Ki+ 568) = f fod (5.32) 


If we do the integral over f (x) and call that the difference of potential 
energies due to friction, we can take that to the left-hand side and get a for- 
mula for energy as a sum of K and two potential energies. What is stopping 
me? The answer is that the force of friction is not a function of just x. You 
might say, “What do you mean? I’m pushing this mass, and I know how 
hard it’s pushing me back.” But then I tell you, “Push it the opposite way!” 
Then, you'll find the force of friction, at the very same location, is pointing 
in the opposite direction. This is unlike the force of a spring, which is —kx 
whether you're going toward the origin or away from it. The same is true 
for gravity. Gravity is pulling down with a force — mg and it doesn’t care 
whether the object is going up or coming down; at a given location there 
is a fixed force. So the problem is not that the frictional force varies with x 
but that the force is also a function of the direction of velocity, which can 
have either sign. Thus f = f (x, v(x)). Let us postpone the evaluation of the 
integral over f and write 


(x + ze) — (x + st) = f fv@)de (5.33) 


B — E = f taroa (5.34) 


I cannot evaluate the integral of f given just xı and x2, because it can 
go from xı to x directly or after one or more oscillations. During these 
oscillations the velocity will change sign repeatedly and so will the force of 
friction at any given x. We must break down the total journey into segments 
wherein the velocity and f have a definite sign at each x. 
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Here is an example where we consider just one segment within which 
v has a definite sign and hence so does f. I pull a mass to x = A and let go. 
First assume a small amount of friction. I know that when the mass returns 
to the center it won't be going quite as fast as without friction. And I also 
know that when it overshoots to the other side, it will not go all the way 
back to x = —A. I want to find A’, the coordinate of the left-most point. 
For this leftward journey from x = A to x = A’ the force is a well-defined 
function: f = +u,mg pointing to the right. The displacement is negative: 
d = —(A— A’) and the work done by friction is 


Wy =—pxmg(A— A’) = -f -(A-A’) (5.35) 


and the energy equation is 


B= f flsv@)dr=-f--A) (5.36) 
ve ae 2_ f(y _ al 
5kA — 5kA =—f-(A—A’) (5.37) 


where we have set Kı = K = 0 at the start and finish. This quadratic 
equation for A’ may be rearranged as follows: 


eal — A) [AAW =] =0. (5.38) 


The root A’ = A is trivial: it corresponds to the initial point where the 
kinetic energy is indeed zero and the total energy equals the initial energy. 
Let us analyze the nontrivial solution 


2 
A’ =—-A+ 2 (5.39) 
As we crank up f from 0 to A, A’ starts moving rightward from x = —A 


f 


and turns positive when we pass 7; = 4, That is, the final point now lies at 
positive x: the oscillator never even makes it back to the equilibrium point. 
Raising f further, we reach the limiting case f = A when the nontrivial 
solution coalesces with the trivial one: the mass never leaves x = A since 
the spring force, at its very maximum of kA, exactly equals the frictional 


force. (In this discussion we assume us = Lk.) 
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Here is the bottom line for the law of conservation of energy. You take 
all the forces acting on a body and equate Kz — Kj to the work done, the 
integral of all the forces. Divide the forces into conservative forces (gravity, 
spring) that depend on location only and the non-conservative like friction 
(which is the only non-conservative force we will consider). The integral 
over each conservative force will turn into an associated potential energy 
difference. Let Wy be the work done by friction. The final result will be 


F — E = Wr with (5.40) 
1 2 


where the potential energy U; is due to the spring, U, is due to gravity, and 
so on. 


CHAPTER 6 


Conservation of Energy in d = 2 


6.1 Calculus review 


We begin with some mathematical preparation for what I’m going to do 
next. Let’s take some function f(x) shown in Figure 6.1. I start at some 
point x with a value f (x). When I go to a nearby point, x+ Ax, the function 
changes by Af = f (x+ Ax) — f (x). All these tiny quantities are exaggerated 
in the figure so you can see them. We are going to need approximations to 
the change in the function as Ax — 0. A common one is to pretend the 


function is linear with the local value of the slope f'(x) = a as depicted 


ee eae approximation 


Figure 6.1 The change, Af, in a function f(x) as x changes by Ax, may be 
approximated by Af ~ f'(x) Ax, where f'(x) = T, The solid line is the actual 
function and the dotted line is the approximation by a straight line of slope f'(x). 
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by the dotted line. The change in f along the straight line is f’(x) Ax. It 
differs from the actual Af by a tiny amount because the function is not 
following the same slope that you have to begin with; it’s curving up. Take 
a concrete example: 


f= (6.1) 
f(x+ Ax) = x" +2xAx+ (Ax)? (6.2) 
Af =2xAx+(Ax)* (6.3) 

Af =f’ (x) Ax+ (Ax). (6.4) 


This result is valid for Ax of any size. We see that the exact change is 
f'(x) Ax plus something quadratic in Ax. If we are interested in very small 
Ax, we may start ignoring all but the term linear in Ax: 


Af =f’ (x)Ax+O(Ax)* (6.5) 


where O(Ax)? signifies that the neglected terms are of order (Ax)? and 
higher. 
Often we will use 


Af ~ f' (xe) Ax (6.6) 


as an approximation for small Ax. 

Consider, for example, f(x) = (1 + x)” and its values near x = 0. 
Clearly f(0) = 1. Suppose you want the function at a point x very close to 
the origin. In this case Ax = x — 0 is just x itself and the approximate value 
will be 


f@M=fO+fOxt+...=l+n(lt+x)" "| _ xt...Sltaxt..., 
(6.7) 


a result we will exploit mercilessly. 


On other occasions, we will take the limit Ax — 0 in the end and 
write the equality 


df =f (ae (6.8) 
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with the understanding that both sides are to be integrated to obtain 


2 x2 
Í df = f (2) — f (x1) = / f' (dx. (6.9) 


6.2 Work done in d=2 


Now we are going to derive the work-energy theorem and the law of con- 
servation of energy in two dimensions. I am hoping I will get some relation 
like Kı + U1 = K2 + U2, where U = U(x, y). How do you visualize the func- 
tion of two variables f (x,y)? On top of each point (x,y) you measure the 
value of f(x, y) in the third perpendicular direction. The function defines a 
surface over the x — y plane and the distance from the plane to the surface 
is the value of f at the point (x, y). For example, (x, y) could be coordinates 
of a point in the United States and the function could be the temperature 
T(x, y) at that point. So you plot on top of each point in the United States 
the local temperature. 

Once I have got the notion of a function of two variables, I want to 
move around the plane and ask how the function changes. But now I have 
an infinite number of options. I can move along x, I can move along y, I 
can move at some intermediate angle. Consider derivatives along the two 
principal directions x and y. We're going to define a partial derivative as 
follows. You start at the point (x,y), go to the point (x + Ax, y), subtract 
the function at the starting point, divide by Ax, and take Ax — 0. This 
defines the partial derivative with respect to x: 


Of _ i f(x+ Ax y) — f(xy) 
— 1m š 


Ox 7 Ax—>0 Ax 


(6.10) 


The curly ô instead of d tells you it’s the partial derivative. As you move 
horizontally, you notice you don’t do anything to y. We could make it very 
explicit by using a subscript y as follows: 


Of) L im Let Am — fey) 
= im á 


Oxy ~~ Ax->0 Ax 


(6.11) 
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We will not do that: if one coordinate is being varied, all the others (of 
which there is just one in d = 2) will be assumed fixed. In the same notation 


f| F _ im fyt An —fey) 


== (6.12) 
Oy|, Oy Ay>0 Ay 


Let’s get some practice with f = x?y?. To find g we see how f varies 
with x keeping y constant. That means we treat y like a number such as 5 
when we encounter it. So we have 


Ox 

y 
OY 2x3y. (6.14) 
OY |x 


We know from the calculus of one variable that you can take the derivative 
of the derivative. Here are the four possible second derivatives and their 
explicit values for f = x°y?: 


0 (af\_ oF _ 
= =6 6.15 
ax (2) T ue) 
2 
ee = l 8 (6.16) 
ôy \ðy/ ay? 
a (a a 
d = L =6x°y (6.17) 
ox \ oy Oxoy 
a (a a? 
( t) = f =6x°y. (6.18) 
Oy \ Ox Oyox 


Notice that the mixed or cross derivatives are equal: 


of _ af 


= . (6.19) 
OyOx Oxdy 
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That’s a property of the generic functions that we will encounter. Pd 
like to give you a feeling for why that is true. For what follows, bear in mind 
that when you make a small displacement in the plane, the change in any 
function is approximately 


cl of 


Afa Ax+ = Ay, (6.20) 
ay 


which becomes an equality in the limit Ax > 0, Ay > 0, and Af — 0: 


S Lay 


df =Z ds+ Z (6.21) 


These limits appear naturally when we plan to sum over the infinitesimal 
changes to get the corresponding integrals. 

Armed with this, let us ask how much the function changes when we 
go from some point (x, y) to (x + dx, y + dy) in Figure 6.2. Were going to 


(x,y+dy) (x+dx,y+dy) 


(x, y) (x+dx,y) 


Figure 6.2 Two ways to go from (x, y) to (x+ dx, y + dy): move horizontally 


and then vertically or vice versa. That the change in f must be the same both 


f _ 2f 
Oydx ~ Oxdy* 


ways becomes the requirement that 
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make the move in two stages. We go via an intermediate point (x + dx, y) 
and add the changes df, and df in each step: 


ð 

= ay (6.22) 
OX | (x,y) 
ð 

y= d (6.23) 
oy (x+dx,y) 
ð a) 

g=] ee dy. (6.24) 
OX | (x,y) OY |(x+-dx,y) 


Notice that the second step requires the y partial derivative at (x + dx, y). 
Because the partial derivative is itself just another function of x and y, we 
may write to leading order in dx 


of 
oy 


mes 


(x+dx,y) o oy 
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dx. (6.25) 
Oxdy 


(xy) 


Upon feeding this into Eqn. 6.24 we find 


df = — ie 
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Ox (x,y) oy 


Oxoy 


dxdy. (6.26) 


(x,y) (x,y) 


If we first moved up to (x, y + dy) and then to (x + dx, y + dy), we would 
get a change in f with x and y interchanged. Equating the results from the 
two ways to find the change in f between (x, y) and (x+ dx, y+ dy) we find 


o? a 
4 dxdy = f 
Oxoy OyoOx 


dydx. (6.27) 


(xy) (x,y) 


Canceling the products of the infinitesimals, we get the equality of the 
mixed derivatives. 
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6.3 Work done in d = 2 and the dot product 


Let us come back to deriving the law of conservation of energy in two 


dimensions. In one dimension we found that if K = im, 

dK _ dv m _p® (6.28) 

di mv Ji mva = Fv Ji ; 
dK = Fdx upon canceling dt above (6.29) 

x2 
Kı- Kı = | F(x)dx upon integrating both (6.30) 
xı sides above 
= U(x) — U (x2), which can be (6.31) 
rearranged to give 

Kə + U2 = Kı + U (6.32) 


provided F did not depend on anything else besides x, such as v(x). 

We want to try the same thing in two dimensions. What expression 
should I use for the work done in two dimensions, given that the force and 
displacement are both vectors with two components each? How should I 
multiply all these par in generalizing dW = Fdx? Here is the solution. 
I’m going to find $ K for a body moving in two dimensions and call that 
the power P = an ne as in d = 1. For that I need a formula for kinetic 
energy. The obvious choice that reduces to what we know is correct for 
motion along just x or y is 


1 1 
K= zm = 5 (v5 +v). (6.33) 
Now we find 
dK dvx r dvy eae 
— = m e= V: — . 
dt aa Y 
ee TAE cae E (6.35) 
TARE II EE y A y l 
dK = Fydx + F,dy (6.36) 


where I have used Newton’s second law F = m4 = and multiplied both sides 
of Eqn. 6.35 by dt, which is allowed in the sense explained earlier. If I define 
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the work done as 
dW = F,dx+ Fydy, 
I find, just as in d = 1, that 
dK = dW = F,dx + F,dy. 
The force and displacement are both vectors 


F=iF,+jF, 
dr = idx + jdy 
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(6.37) 


(6.38) 


(6.39) 
(6.40) 


and their components enter dW in the combination dW = F,dx + Fydy. 


Likewise the power P may be written as 


dK 
P= a Vx + Fyvy. 


Given two vectors 


A=iA,+jAy 
B=iB,+jB,, 


(6.41) 


(6.42) 
(6.43) 


we see that the combination A,B, + A,B, appears very naturally. It has a 
name: the dot product of A and B, denoted by A - B. That is, by definition, 


A-B=A,B, + AyBy. 
In this notation 


dW =F-ar 
P=F.-v. 


A few factoids about A - B. First 


A-A=Al+A, =A 


(6.44) 


(6.45) 
(6.46) 


(6.47) 
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where A is the length of A. 
Next if 04 and Og are the angles A and B make with the x-axis, then 


A-B=AxBy+ AyBy (6.48) 
= AcosO4Bcos0p + Asin O,4BsinOp (6.49) 
= AB[cos@,4 cosg + sina sing] (6.50) 
= ABcos [0g — 04] = ABcos [64 — Og], (6.51) 


which is usually written more compactly as 
A -B = ABcosð, (6.52) 


where it is understood @ is the angle between the vectors. It can be mea- 
sured from A to B or the other way since cos@ is unaffected by a sign 
change in 0. 

Equation 6.52 works even in d = 3 because A and B can still be made 
to lie in a plane and @ defined as the angle between them in this plane. 
However, in terms of components we must bring in all three components: 


A-B=A,B, + AyBy+A,Bz, (6.53) 


a result that seems reasonable and one which can be verified after some 
messy trigonometry. 
The dot product is symmetric since cos@ = cos(—@): 


A-B=B.A. (6.54) 


Note that if we set A = B, then A- A = AAcos0 = A’. 

The two definitions of the dot product, Eqns. 6.44 and 6.52, are fully 
equivalent. If you are thinking in terms of the components, A,B, + AyBy is 
more natural, while if you are thinking in terms of arrows of some lengths 
and angles, ABcos@ is preferred. Which one you use depends on your 
goals. 

For example, to establish an important property that the dot product 
is distributive: 


A-(B+C)=A-B+A-C (6.55) 
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it is easier to proceed as follows 


A- (B+C) = Ax(Bx + Cx) + Ay(By + CG) (6.56) 
= A,B, + AyBy+AxCy+AyCy =A-B+A-C. 
(6.57) 


On the other hand, using A - B = ABcos@, it is easier to establish the 
following very useful results: 


e IfAand Bare parallel, i.e., 9 = 0, their dot product is a maximum. 

e IfA and B are perpendicular, their dot product is zero. 

e Under a rotation of axes, A - B is invariant or unchanged, because the 
lengths and the relative angle are unchanged by a rotation of axes. 


Of course, for every proof with one definition, a possibly more cumber- 
some one, which uses the other definition, also exists. 

Let us return to the work-energy theorem using the dot product 
notation: 


dK =F. dr = dw. (6.58) 


The work done by a force when it moves a body by a vector dr is the length 
of the force vector times the distance traveled, times the cosine of the an- 
gle between the force vector and the displacement vector. That is also the 
change in kinetic energy dK. Let us make a big trip in the x — y plane, 
shown in Figure 6.3, starting from a point rı = 1 and ending at r2 = 2, and 
made up of a sequence of little segments dr in each one of which I calcu- 
late F- dr. When I add their contributions to the change in K and the work 
done, I get, as the segments’ sizes tend to zero, 


2 2 
J dK = K — Kı = J F. dr. (6.59) 
1 1 


The right-hand side is called the line integral of the force F between 1 and 2 
alonga path P. 
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Figure 6.3 The line integral of a force between points 1 and 2 along a path P is 
the sum of dot products F - dr over tiny segments that make up the path, in the 
limit dr — 0. Also shown by a dotted line is another path P) between the same 
end points. 


6.4 Conservative and non-conservative forces 


Suppose it is true, just like in one dimension, that the line integral of the 
force is something that depends only on the end points. Let us call the 
answer U(1)— U(2), just like we did in one dimension. I am done, because 
then I have 


Kı + U2 = Kı + Uj. (6.60) 


To make sure this is correct, I ask the mathematicians a question: “You told 
me the integral of F(x) from start to finish is really the difference of another 
function G at the upper limit minus G at the lower limit, with G related to 
F by F= a Is there a similar result in d = 2?” Sadly, this is not the case. 
What could go wrong? Yes, friction will do it, but let us assume there is no 
friction, and that F depends only on r. Can something still be wrong? Well, 
let me ask you the following question. Suppose I go from 1 to 2 along path 
P, and another person goes along path P2. Do you think that person will 
do the same amount of work, even though the force is now integrated on 
a longer path? In two dimensions, there are thousands of ways to go from 
one point to another point. Therefore, this integral is not specified by just 
the end points; it depends on the entire path, which needs to be specified. 
If the work done depends on the path, then the answer cannot be of the 
form U(1) — U(2), which depends only on the end points. 

I digress to point out that even in d = 1, there are many ways to 
go from x; to x2 > xı. For example, we can go directly to x) or we can 
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overshoot to x3 and swing back to x2. The answer will be the same, be- 
cause for every segment from x to x3 that makes a contribution F(x)dx, 
an equal and opposite contribution exists on the way back to x2, because 
F(x) remains the same, and dx changes sign. In this sense, every force F(x) 
in d= 1 is conservative. Of course, if it is friction we are talking about, the 
two canceling pieces now add, because F = F(x, v(x)) reverses sign along 
with dx. 

Returning to d = 2, I am going to show that the work done by a 
generic force will be path-dependent. To generate a random force, I asked 
my class to give me numbers from | to 3, and I got the following list: 2, 2, 
2, 1, 1, and 2. Using these randomly generated numbers as coefficients and 
exponents, I wrote down a force: 


F(x,y) = i2x?y? + jxy’. (6.61) 


For example, the 2x” y’ is from the first three 2’s chosen by the class. 

Is it true for this generic force, essentially picked out of a hat, that the 
work done in going from one point to another depends only on the end 
points, or does it depend in detail on how you go between the end points? 
We will find that the work done along two paths, joining the same two end 
points, will give two different answers. 

Let’s find the work done in moving from the origin, (0,0), to the 
point (1,1). I will take the two paths shown in Figure 6.4. In one path I go 
horizontally until Pm at (1,0), below the point (1,1), and then straight up 
to (1, 1). In the other path, I’m going straight up to (0, 1) and then on hor- 
izontally to (1,1). So, lets find the work done when I go the first way. I’m 


(0,1) @ 


dr=idx (1,0) 


Figure 6.4 The line integral of a vector from (0,0) to (1,1) along two paths. 
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going to integrate F. dr first on the horizontal segment, then on the vertical 
segment. On the x-axis if I move a little bit I have 


dr = idx (6.62) 
F(x, y) = i2x’y? + jxy? = 0 because y = 0 on the x-axis (6.63) 
F.dr=0. (6.64) 


In other words, the work done in this segment is zero because F itself 
vanishes when y = 0. In the vertical segment from (1,0) to (1, 1), 


dr = jdy (6.65) 
F(x,y) = i2x*y* + jay? =i2y? + jy 
because x = 1 on this segment (6.66) 
F- dr = y’ dy (6.67) 
1 1 
[ras ydy=-. (6.68) 
0 3 


So the work done on this path is W; = 0+ 1 = 7 
On the second path, we have no contribution from the vertical seg- 
ment because F = 0 for x = 0. In the horizontal segment at y = 1, we 


have 


dr = idx (6.69) 
F(x, y) = idx? 7 4 jxy? = i2x + jx 
because y = 1 on this segment (6.70) 
F- dr = 2x" dx (6.71) 
: 2 
[rag 2x dx = =. (6.72) 
0 3 


So the work done on this path is W2 = 0+ p = Z, 

The answer is path-dependent. 

I have shown you that if we took a random force, the work done is 
dependent on the path. For this non-conservative force, you cannot define 
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a potential energy, whereas in one dimension any force other than friction 
allowed you to define a potential energy. 

Our quest for a conserved energy leads us to search for a conser- 
vative force, a force for which the work done in going from 1 to 2 is 
path-independent. 


6.5 Conservative forces 


At first sight a conservative force looks miraculous. A randomly generated 
force was seen to have a line integral that depended on the path. How can 
the path dependence ever go away? Do conservative forces exist, and, if yes, 
how are we to find them? 

Do not despair. Here is an algorithm that will produce any number 
of conservative forces. 


e Take any function U(x, y). 
e The corresponding conservative force is 


F=-i— -j—. (6.73) 


e The potential energy associated with this conservative force will be U 
itself. 


Here is an example. 


U(x, y) = xy? (6.74) 
ðU 
T y (6.75) 
x 
aU 
oy 
F = —iy? — j3xy’. (6.77) 


Let me prove to you that the recipe works. The change in the function 
U, due to a small deviation from (x,y) to (x + dx, y + dy), is 


ðU 
Ox 


ðU 
dU = dx + — dy (6.78) 
oy 
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in the limit as all changes go to zero. Writing this in terms of F 
dU = —F, dx — Fydy = —F - dr. (6.79) 


Adding all the little pieces and changing the sign of both sides, we get 


2 
U(1) — U2) = J F- dr = K — Kı, (6.80) 
1 


which is the law of conservation of energy with U as the potential energy. 

So I cooked up a force such that F - dr was a change in a certain func- 
tion U. If I add all the F- dr’s, I'm going to get the change in the function 
U from start to finish. We are beginning to see why certain integrals do 
not depend on the path. Here is an analogy. Forget about integrals. Imag- 
ine I am on some hilly terrain. I start at one point, and I walk to another 
point. At every portion of my walk, I keep track of my change in altitude, 
with uphill as positive and downhill as negative. That is like my dU. I add 
them all up. The total height change will be the difference in the heights 
of the end points. Now, you start with me but go on a different path. You 
wander all over the place but finally stop where I stopped. If you kept track 
of how long you walked, it wont be the same as my walk. But if you also 
kept track of how many feet you climbed at each step and added them all 
up, you would get the same answer I got. I repeat: if what you were keep- 
ing track of was the height change in a function, then the sum of all the 
height changes will simply be the height at the end minus the height at the 
beginning, independent of the path. Conversely, starting with the height 
function, if you manufacture a force F whose components are its partial 
derivatives, F - dr will measure the height change in each segment, and 
the line integral will yield the total height change between start and finish, 
independent of the path. 

Consider the line integral of a conservative force on a closed loop, 
that is, when the starting and ending points 1 and 2 in Figure 6.3 coincide. 
Because this represents the change in U between some point and the same 
point, it vanishes for any loop. This is expressed as follows: 


$ F-dr=0 ifF is conservative. (6.81) 
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Are there other ways to manufacture the conservative force? No! One 
can show that every conservative force can be obtained by differentiating 
some corresponding U. 

Remember how I went out on a limb with the randomly chosen force 
the class generated and promised I was going to do the integral along two 
paths and get two different answers? What if the force had been a conser- 
vative force? Then I would have been embarrassed, because I would find, 
after all the work, that both paths gave the same answer. So, I had to make 
sure right away that the force was not conservative. How could I tell? I 
asked myself, “Could there be some function U (the negative of) whose x 
and y derivatives could equal i2x”y? + jxy*?” I knew the answer was No 
because if I took a y derivative of such a U to get Fy, then F, should have 
one less power of y than Fx, but in our example the powers of y were the 
same in both. I will describe shortly a better way to analyze this question. 

While it is true that even one example of path dependence (as illus- 
trated above) is enough to show a force is non-conservative, getting the 
same answer on two or even two thousand paths between any number of 
fixed end points does not mean the force is conservative. It could be ac- 
cidental. Some other path or some other end points may show the force 
is non-conservative. Conversely it could happen that a non-conservative 
force, like the one I just worked with, has the same integral for two partic- 
ular paths joining two particular end points by pure accident. I took that 
gamble and lucked out. 

But if the force is really conservative, how are we to show that? Here 
is the wonderful test I promised. If F is conservative, it must come from a 
U by taking partial derivatives, as per Eqn. 6.73. It follows that 


OF ou 

a (6.82) 
oy OyOx 
OF aU 
Z which means (6.83) 
Ox Oxoy 
OF. F 

~ = — because the cross derivatives are equal. (6.84) 
oy Ox 


If I give you a force and ask you, “Is it conservative?” you simply see if 


OF, OF, 
dy  ôx` 


(6.85) 
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If it is, you know the force is conservative; if not, it is not. 
The example we considered, 


F(x,y) = i2x?y? 4+ jxy’, (6.86) 
fails the test: 
OF OF 
y Ox 


The two most ubiquitous forces, gravitational and electrostatic, are 
conservative. 

For longer discussion of this topic that fills in many blanks, see my 
Basic Training in Mathematics. 


6.6 Application to gravitational potential energy 


Let’s take the most popular example: the force of gravity near the surface of 
the earth given by Fẹ = —jmg = mg where g = —jg. It is conservative be- 
cause the x derivative of F, vanishes, and there is no Fx to differentiate, so 


OF g : : : 
that a = — = 0. What is the potential U that led to this? You can easily 


Ox 

guess that U = mgy will obey F, = — ore You can also have U = mgy + 96, 
but we will not add those constants. In the law of conservation of energy, 
Kı + Ui = K2 + U2, adding a 96 to the U on both sides doesn’t do any- 
thing. You already knew this from our study of motion in one dimension, 
and I am pointing out that this is also true in two dimensions. 

Consider an application. Figure 6.5 shows a roller-coaster track that 
has a wiggly shape. At every x, there’s a certain height y(x) and a potential 
energy U(x) = mgy(x), which is essentially just the profile of the roller- 
coaster track. If a coaster begins at rest at point A at the top, what is its 
total energy? It has a potential energy given by the height h, it has no ki- 
netic energy, and so the total energy is just E; = mgh. But the total energy 
cannot change as the coaster goes up and down. So, you draw a line at 
height E; to represent this total energy. If the coaster is at some point x, 
then Uj(x) is its potential energy and the rest of E; is its kinetic energy 
Ky (x) as shown. As it oscillates up and down during its ride, the coaster 
gains and loses kinetic and potential energies, which always add up to the 
same EF). 
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Figure 6.5 The roller-coaster ride. The total energy is fixed at E; or F) in the 
two examples discussed. At every point x, the sum of the potential energy U(x) 
and the kinetic energy K(x) equals a constant E. If the energy is E2, the coaster 
can only be found between B and C or to the right of D. It is disallowed in the 
region CD where the potential energy exceeds the total energy, and K would have 


to be negative. 


Consider a roller coaster whose total energy is E2. We release it from 
rest at point B. It'll come down, pick up speed, slow down, stop, and turn 
around at C, because, at that point, the potential energy is equal to the total 
energy and there is no room for any kinetic energy. It'll rattle back and 
forth between B and C. If we release it from rest at D, it will have the same 
energy E>, and it will coast down to the end of the ride. But it can never go 
from C to D because in the region CD it would have more potential energy 
than total energy, and hence negative kinetic energy, which is impossible. 

However, according to laws of quantum mechanics, a particle with 
energy E can disappear from the region BC and tunnel to D with the same 
energy. I use the word tunnel because in classical mechanics, the particle 
cannot cross the potential energy barrier in the interval CD. In quantum 
theory you cannot raise this objection because particles do not move along 
continuous, interpolating trajectories between two observed locations. 

Back to the coaster: We can use energy conservation to find the speed 
at any point along the track. We can use it to determine the minimum 
height H from which the coaster in Figure 4.6 must be released so as to 
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reach the top of the loop (at a height 2R measured from the ground) at the 
minimum requisite speed of v = ,/Rg. We write 


1 1 

5m 07 + mgH = 5mRg + mg(2R) (6.88) 
5 

H=7R (6.89) 


A final note. The law of conservation of energy for the coaster as I 
stated it is incomplete, because gravity is not the only force acting. There 
is Fy, the normal force of the track. Look, if I didn’t want to have any force 
but gravity, I could take this roller coaster and just push it over the edge of a 
cliff. That converts potential to kinetic energy, but the outcome is not going 
to be good for the riders. Park designers build a track because they want 
the customers to survive the ride and come back for more. So the track 
should exist, and the consequent Fr should be included in computing the 
work. Luckily, this normal force does no work, because Fp - dr = 0 in every 
portion. So the correct thing to do would be to say K3 — K; is the integral 
of all the forces, divide them into Fy due to the track and F, due to gravity, 
and drop Fr for the reason mentioned. 


CHAPTER 7 


The Kepler Problem 


7.1 Kepler’s laws 


Next we discuss one of the most famous problems involving a conservative 
force: celestial motion under the influence of Newtonian gravity. Were go- 
ing to make a big leap beyond inclined planes, pulleys, and whatnot; we 
are going to understand how planets move around the sun. That’s a mega 
problem, right? The little m’s you put in the equation are not the masses of 
a pulley or a block, but the mass of Jupiter or the sun. You're doing some- 
thing of cosmological proportions. And you don’t need to know too much 
more to do that. Youre almost there. 

The situation was as follows at Newton’s time. Nicolaus Copernicus 
had proposed that the way to think about our solar system is to put the sun 
S at a fixed point as shown in Figure 7.1 and let the planets move around it. 
I have shown just one planet called P. You have to agree that Copernicus’s 
contribution was truly remarkable. First of all, that was an era when it was 
not “publish or perish” but “publish and perish.” It was not a good idea 
to come out and say what you thought about celestial objects. More than 
that, how did Copernicus ever figure this out? Even today, even when I 
know the heliocentric model is correct, when I look around, it doesn’t at 
all look like this to my eyes. I concede that if you looked at our solar system 
from far away, it would look very simple. But what is our vantage point? 
We are sitting on the third planet from the sun, spinning around our own 
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axis, and going around the sun. We think we are at rest, and everything is 
spinning around the opposite way. To deduce from that chaos this simple 
picture was quite remarkable. This shift in thinking is properly called the 
Copernican revolution. 

After the Copernican revolution, people decided to take the data we 
get with the earth as the center, transcribe the data with the sun as the cen- 
ter, and analyze the new results. Tycho Brahe was a Danish aristocrat who 
had his own lab and studied the solar system. Johannes Kepler, a mathe- 
matician, was his assistant who worked on this problem for forty years and 
then published his findings in the form of three profound laws that were 
well worth the wait. So what if Kepler gave us just three laws after forty 
years? It is still better than Congress. 

(I have to tell all of you who are going into science not to wait forty 
years to publish. You're not going to get a job; you are not going to get a 
grant; and if you get a PhD, you will be at retirement age when you gradu- 
ate. In today’s climate such long-term projects are at risk. One of the rare 
counter-examples that comes to my mind is the work by Raymond Davis 
Jr. on neutrinos, which lasted more than thirty years and ultimately helped 
resolve an outstanding puzzle connected to them.) 

Here are Kepler’s three laws, followed by explanations. 


e The planets move in elliptical orbits with the sun S at a focal point. 

e As the planet moves, its position vector SP in Figure 7.1 sweeps out 
equal areas in equal times, say as between 1 and 2 and between 3 and 4. 

e The ratio T, where T is the time period and a the semi-major axis of 
the ellipse, is the same for all planets. 


Let me remind you that an ellipse is the locus of points the sum of 
whose distances from the two focal points (r + 7’ in Figure 7.1) is a con- 
stant. To draw an ellipse, you take a string of length 2a and you nail its 
two ends to the focal points S and E using thumbtacks and pass it over the 
pencil tip at P as shown in Figure 7.1. Keeping the string taut, you trace 
a closed loop, and you have the ellipse whose major axis is 2a and whose 
semi-major axis is a. Thus for the ellipse 


r+r =2a. (7.1) 


Let us understand why the string length 2a is also the major axis, the width 
of the ellipse in the x-direction. Imagine the pencil at point A. The length 
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Figure 7.1 The solar system seen from afar, with just one planet P shown. The 
orbit is an ellipse with the sun S at one focal point, and a major axis 2a that 
obeys r+ r’ = 2a. The position vector sweeps out equal areas in equal times, for 
example, between 1 and 2 and between 3 and 4. 


of the string goes from S to A, and back from A to E. Take the segment AE 
that is covered twice, and borrow one cover to match the segment BS, and 
you see that the length of the string spans the major axis 2a. 

If you move the two focal points to a single point, you get the circle 
with radius R = a. (The minor axis 2b is the width in the perpendicular 
direction and will not figure in this elementary treatment.) If the sun is 
at S, what is at the other focal point E? Nothing, as far as anyone can tell, 
though there are rumors Elvis has been sighted there. The labeling in the 
figure acknowledges this very credible possibility. 

Now for Kepler’s second law. Follow any one planet for a fixed period, 
say one week, as it moves from 1 to 2. Measure the area swept out, which is 
the area between the radius vectors rj, r2 and the part of the orbit between 
1 and 2. Repeat this for any other one-week period, say between 3 and 4, 
and you will measure the same area. Indeed, this rate of sweeping area is 
equal even for infinitesimal times, which allows us to say 


J e ( ) 
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Kepler’s third law states that 


2 


— = same for all planets in the solar system (7.3) 
a 


where T is the time period and a is the semi-major axis. In other words, 
if you plug in T = 365 days and a = 93,000,000 miles for the earth (as- 
suming it is on a circular orbit, which it nearly is), you get some number: 
roughly 2.96- 1071? s?m~*. Now compute this ratio for Jupiter, and you get 
3.01 - 107}? s*m~?. Very impressive! 

Students often ask if Kepler’s laws have corrections. They certainly 
do, like every other law. First of all, planets are not moving just under the 
influence of the sun but also other planets, especially Jupiter. Secondly, the 
Newtonian law of gravitation has been modified by Einstein’s general the- 
ory of relativity. Both these effects prevent the orbit from being closed. 
The major axis slowly rotates with time, and this is called precession of the 
perihelion, the effect being most pronounced for Mercury. After all the cor- 
rections explicable in Newtonian terms, a tiny amount, 43 degrees of an arc 
per century, remained unexplained. (A degree of arc is 1/3600 of a degree.) 
In a remarkable feat of human invention, the general theory of relativity 
explained that last discrepancy. 

Kepler’s data were available to Newton. Newton, as you know, was 
sent home from college because there was a plague in Cambridge. He went 
and lived in his old village, contemplating gravitation. Newton had already 
invented F = ma but not the F for the particular case of gravity. He then 
went on to do just that. 


7.2 The law of universal gravity 


Newton built on his earlier insight that you have to associate a force 
with acceleration and not with velocity. If you follow a planet as it moves 
around, thinking it is force that causes velocity, you won't get any definitive 
answer as to what is behind the force. On the other hand, if you calculate 
the acceleration of the planet, you will find that, at every instant, it points 
toward the sun. (This is obvious for the circular orbit.) If all bodies are ac- 
celerating toward the sun, it’s fairly clear the reason for the acceleration is 
the sun. You then postulate a force that’s exerted by the sun on the planet 
that bends it into a circle. Your job is to find that force. 
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What’s the nature of the force? Again, it was Newton who figured 
out that the force that bends the planets around the sun is the same as the 
force that bends the moon around the earth or makes the apple fall to the 
earth. Now, the fact that the moon, orbiting at a radius Rn at a speed vm, 
is accelerating toward the earth at a rate 

Vin 
am= > (7.4) 


m 


is something you have already learned in this course. As for the apple, it 
also is accelerating toward the earth at a rate 


da =g > 9.8 ms’, (7.5) 


independent of its mass ma. 

Let us guess the formula for the force on the apple, looking at just the 
magnitude, the direction being obviously toward the center of the earth. 
Near the earth, we all know, the acceleration of every body is the same. 
Therefore from a = z, we deduce that the force of gravity on a falling body 
is itself proportional to the mass of the body, so that the m may cancel out. 
So we can write for the apple 


F, = Maf (Me, Ra) (7.6) 


where the unknown function f depends on Me, the mass of the earth, and 
its radius Re, which is also the distance Ra between the apple and the center 
of the earth. 

What else do we need? The third law says that if there’s a force on the 
apple exerted by the earth, the apple must exert an equal and opposite force 
on the earth. Consider the earth and the apple—but imagine the apple get- 
ting bigger and bigger. The formula is not going to change. Wait until the 
apple is huge compared to the earth. Then you will have to agree that the 
earth is falling toward the apple rather than the other way around. Under 
this exchanged role, the force on the earth must then be proportional to 
Me. Thus, for any two bodies of mass m and M, we obtain an expression 
for the force compatible with Newton’s third law, 


F= (Mmf (R), (7.7) 
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where R is the distance between their centers. We don’t know the distance 
dependence; we don’t know the function f(R). 

To find it, let us compare the acceleration of the apple and the moon 
due to the pull of the earth: 


_ Mema 


aa = ——f (Ra) (7.8) 
Be Me Mn f(Rn) (7.9) 
Ga _f (RÒ (7.10) 
Am f (Rm) i l 


We know a, = 9.8ms~*. What about apm? It is given by kinematics to 


be am = ye, Does anybody know how far the moon is? If you guessed a 
million miles, that is not bad; the correct answer is 238,000 miles. If you 
make an estimate that’s off by a factor of 4 in astrophysics, it’s fine, but if 
you say Rm = 1000 miles, we should have a very long talk. Anyway, let us 
round it off and say Rm = 240,000 miles. 

Next, what’s the radius of the earth? You have some idea, right? 
How far is California? Three thousand miles. And how many hours is 
the time difference? Three hours. So that is one hour per thousand miles. 
If you go all the way around the earth and come back, the accumulated 
time difference has to be 24 hours. That means the earth has a circum- 
ference of roughly 24,000 miles. Dividing by 2m ~ 6, we get Re ~ 4000 
miles. 

I know that we should work with meters and kilometers but, like 
the rest of you, once I get on the freeway I’m watching how many miles 
per hour Pm driving, not how many meters per second. Nonetheless, 
we Americans do use a lot of British units. If you go buy insulation at 
Home Depot, it’s rated in BTUs per slug per poundal, right? Sometimes 
you wonder why we fought the War of Independence if were still us- 
ing those units. Anyway, my brain is split. When I do physics, I use the 
metric system. When I shop at Home Depot, I use the Home Depot 
units. 3 

Back to finding the acceleration of the moon, am = e We'll assume 
every orbit is a circle; that assumption turns out to be not so bad, even for 
planets. We already have Rm, which also gives us the length of the orbit 
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2a Rm, which it completes in roughly T = 28 days, yielding a velocity v, = 


2m Rn and an acceleration 


27 Rin/T)* 
ag CE (7.11) 
Rin 
If you plug in the numbers, you find 
aa f (Ra) 
— ~ 3600 (7.12) 
Am f(Rm) 


Given that Rn = 240000 = 60, you don’t have to be a Newton to figure out 


that 


JR) =-=. (7.13) 


Combining this with Eqn. 7.7, we find the great law of universal gravity: 


Mm 


G= 6.67 - 1071! Nm kg’, (7.15) 


where G is the universal gravitational constant that balances the units in 
Eqn. 7.14 and ensures that the numerical value we obtain at the surface of 
the earth reproduces g = 9.8ms7?. 

In this argument, we are assuming that the distance between the ap- 
ple and the earth is Re, the radius of the earth. Why not use the height of 
the tree from which the apple fell? Because Newton’s formula is actually 
written down for two point-like objects, with an unambiguous distance 
between them. The correct way to handle the earth is to divide it into many 
small pieces and find the force on the apple due to each piece and add, or 
rather, integrate, over their contributions. The result will be that the earth 
acts as if all its mass were concentrated at its center. Newton knew this to be 
true, but he could not prove it to his satisfaction for many years, which is 
why he delayed publication. Even today it is a hard problem in integration. 

Here is another similar result. Suppose you are inside a hollow spher- 
ical shell of some mass M. What force will you feel? It is clear that if you 
are at the center, you will feel no force because for every piece of matter 
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in the shell pulling you one way, there is an identical one pulling you the 
opposite way. What is not obvious, but true, is that the gravitational force 
will be zero inside the entire shell. Of course, outside the shell the force will 
be that of a point mass M sitting at the center. In summary, for any spher- 
ically symmetric distribution of mass, the force felt by a body at radius r is 
due to all the mass inside a sphere of radius r, acting as a point mass at the 
center, while the mass outside contributes nothing. 

Equation 7.14 is rightly called the law of universal gravitation. It was a 
tremendous leap of faith to believe that the laws that are operative near the 
earth also apply to the moon and beyond. This was the year 1687; people 
believed in witchcraft and harbored all kinds of superstitions. They were 
not thinking in modern scientific terms. They had a lot of illusions about 
what the heavens were made of. To believe they’re made of the same stuff, 
and controlled by the same laws, was far from obvious in those days. 

Newton’s leap of faith has proven extraordinarily prescient: not only 
the law of gravitation but all the laws of physics that we deduce near the 
earth seem to work over the entire universe, not just now, but even in the 
distant past and, we hope, in the future. Indeed, given the long times light 
takes to get to us from far away galaxies and quasars, much of what we see 
in the heavens today happened a long time back, and yet we analyze the 
data using the recently discovered laws. We have sampled a very tiny part 
of the universe, over a tiny period of time, but we apply the laws we deduce 
here and now to the far reaches of the universe and all the way back in time 
to the big bang. We confidently predict the future fate of the universe. It’s 
a great break for us that the laws we find seem to be universal and eternal. 
It need not have been so. 


7.3 Details of the orbits 


I will apply the law of gravity to the simple case where a planet of mass m 
orbits a sun of mass M >>> m so that the latter may be assumed to stay 
put despite the pull of the planet. The origin of coordinates is chosen to be 
at the sun. We have the equation we need, 


-z == Ên (7.16) 
r 


where e, = = is a unit vector that points from the sun to the planet. 


The Kepler Problem 109 


This is now a problem in calculus, and everything Kepler said should 
come out of its solution. You should find that planets move in elliptical 
orbits. You should find that equal areas are swept out in equal times. You 
should find the square of a time period divided by the cube of the major 
axis is independent of the planet’s mass. 

Even centuries later, I find it takes the class in advanced mechanics 
quite an effort to solve this equation. It is one thing to write it down, an- 
other to actually solve it and get the elliptical orbits. But Newton did all 
that hundreds of years ago. 

Just imagine that you wrote Eqn. 7.16 but couldn’t solve it. You would 
have found the correct law of gravity, but you could never be sure it was 
right, or convince others, because you could not find the consequence of 
your equations. That is not unheard of. Consider the theory of quarks, 
which we believe to be the constituents of protons, neutrons, and so forth. 
We think we know the underlying equations of motion and forces be- 
tween quarks. But we do not yet have a way to show, analytically, that 
the underlying equations imply the phenomenon or particles that we see. 
However, by solving them approximately on big computers, we are fairly 
certain, after years of work, that the equations are correct. For a new the- 
ory to be accepted, its signature consequences must be worked out exactly 
or approximately, and these must agree with experiment to persuasive 
accuracy. 

Returning to our problem: although we have the means to prove the 
orbits are elliptical in general, we are going to specialize to circular orbits. 
You're always allowed in an equation to make an assumption and plug it in 
to see if it works. We are going to assume there is a circular orbit of radius 
r, in which the planet is moving at speed v, and see if it is permitted by the 
laws of motion and gravity. In the radially inward direction, F = ma gives 
us 


; (7.17) 


The left-hand side is the effect, the right-hand side is the cause. If yowre 
spinning a rock tied to a string in a circle, it has an acceleration toward the 
center, and the string is providing the requisite force. Here, it’s the unseen 
force of gravity from the sun, reaching out to a planet and pulling it in. So 
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let’s cancel the m and one power of r from both sides to get a very useful 
equation: 


vr= GM. (7.18) 


We find we can have a circular orbit of any radius we like, provided 
the speed satisfies this equation. If you want to launch a satellite at some 
radius r, launch it at this speed at that radius. And as long as you satisfy 
the equation, it does not matter if the thing that is orbiting is actually a 
satellite, a space station, or a potato, because the mass of the object has 
dropped out, taking with it the identity of the object. 

This is all we can get out of Newton’s laws. Let’s go back to see how 
much of Kepler follows. We have shown that the circle, which is a special 
case of the ellipse, is possible. How about equal areas in equal time? It’s 
obviously true in this problem, because by assumption, the planet is going 
at a constant speed on a circle of fixed radius r. So, the only thing left to 
knock off is Kepler’s third law, relating the time period to the size of the 
orbit. 

Let us plug into Eqn. 7.18 the fact that the velocity of the planet is the 
circumference divided by the time period 


2 
v= ~ (7.19) 
to obtain 
4r?r? 
T = GM (7:20) 
T? 4r? 


which is Kepler’s third law, since r is the semi-major axis a for a circle, and 
the right-hand side does not depend on the planet, just the sun. 

What did Newton do that went further than Kepler? Kepler said T 
was a constant for all planets but did not say what the constant was in 
terms of anything else. Newton tells us what it is in terms of the mass of 
the sun, z, and G. If we plug in M = 2- 10%° kg for the sun, we find n is 
essentially the same number ~ 3 - 107!°s?m7? that we encountered earlier 
in connection with the 7 data for the earth and Jupiter. 
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A similar thing happened in atomic physics. A schoolteacher called 
Johann Balmer was analyzing the frequencies of light emitted by atoms 
like hydrogen, and he observed that all the frequencies were given by the 
formula 


i 1l 
f=R (= = =) (7.22) 


no Mm 


where R is a constant (for a particular atom) and nı and m > n are any 
two positive integers. He knew the value of R from the data but not in 
terms of anything more fundamental. Bohr then derived a formula of this 
form from making a quantum postulate, and in the process he obtained an 
expression for R in terms of fundamental constants like Planck’s constant, 
the mass, the charge of the electron, and so forth. Balmer did for Bohr 
what Kepler did for Newton, which was to condense the complicated data 
into some simple form so that theorists could have a crack at it. Bohr did 
for atoms what Newton did for gravity, which was to provide the theory 
underlying the phenomenologically observed behavior. 

Now you can do a variety of problems using T = = where M is 
the mass of the heavy object, the “sun” in the problem. One interesting 
example is the following, depicted in Figure 7.2. It shows the earth, as we 
look down from above the North Pole. I’m at point A and I want to watch 
a tennis game being played at B. I have access to radio waves, but they 
cannot go through the earth and they can only travel in straight lines. The 
solution is to have three satellites forming a nice triangle as shown. Each 
covers a part of the surface of the earth. If B can send the image to satellite 
3, then 3 can send it to 1 along the dotted line, and then 1 can beam it to 
me since I lie within its broadcast cone. If you have three suitably placed 
satellites, they can help you connect any point on the earth to any other 
point in this manner. But the satellites better be where you think they are 
at all times. If they’re constantly moving around, it doesn’t work. So, what 
you really want are geosynchronous satellites. If you look down at the North 
Pole, you see the earth is spinning counterclockwise. To stay on top of a 
fixed point above the earth, these satellites should be rotating around the 
earth once every 24 hours. The only question is, at what altitude should I 
launch them? I put in T = 24 hours in 


SS (7.23) 
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Figure 7.2 The view as we look down at the North Pole shows three 
geosynchronous satellites, each of which hovers over a fixed point on the earth, 
takes 24 hours per revolution, and covers a part of the earth as shown. By 
communicating with them, and allowing them to communicate with each other, 
any point on the earth can communicate with any other. If B can send the image 
of a game to satellite 3, it can send it to 1 along the dotted line, and that in turn 


can beam it to me at A. 


Twa 


and get the radius of a = 42,200 km. Once I have the radius, v = zra 
(Eqn. 7.19) will tell me at what velocity they should be launched into orbit. 
Naturally, using more than three satellites will provide better television and 
cell phone connections. 


7.4 Law of conservation of energy far from the earth 


What’s the potential energy I can associate with the gravitational force? We 
have already seen that near the earth 


F, = —jmg (7.24) 
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z i OF, _ Fy 
1s conservative since Sy = 


= = 0. The corresponding potential we have 
Ox 


been using is 
U = mgy, (7.25) 


where y is measured vertically up from the ground. With this choice U = 0 
on the ground. 
Now consider the formula for the force valid for all distances: 


GMm GMm GMm 


F, = —€; 2 =-r 3 = —(ix+jy+kz) 


; 7.26 
=> (7.26) 


using the fact that e,, the unit vector in the radial direction, is just A 


I claim that this force comes from the following potential upon taking 
partial derivatives: 


If this were true we should find Fy = — 2U, 


G OE (7.28) 


and likewise for y and z. By symmetry, if it works for x, it will work for y 
and z. Consider 


ðU o(l 
OU oa A (7.29) 
Ox Ox 
Ce (7.30) 
= — m—-— . 
r? ôx 
GM: ð 2 2 2 
a N NE (7.31) 
r2 Ox 
GMm 1 1 
== 2x (7.32) 
r 2 x2 4+ y2 + 22 


m 
=x =F, QED. (7.33) 
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Asind=2if Fy = 2U and similarly for y and z, it follows that 
ga + +a |- —dU (7.34) 
Pra meca (7.35) 


that is, the line integral of F is path-independent and U is the correspond- 
ing potential energy. 

Note that g may be written in terms of G, the mass of the earth Me, 
and its radius R, by invoking the law of universal gravity for a body of mass 
m at the surface of the earth: 


GMem 


F= R =mg which means (7.36) 
GM, 
g= (7.37) 


If we want to consider motion on a celestial scale we must use the for- 
mula for U that is valid for all distances and use for the conserved energy 
the expression 


1 , GMm 
E= -m — : (7.38) 
2 r 
7.5 Choosing the constant in U 
Since for any r 
1 GM. 
pay n (7.39) 
7 


is the exact conserved energy, we may expect that for an object moving at 
a modest height y above the earth, this must reduce to 


1 
E= sm + mgy. (7.40) 
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The potential energy in the exact result will equal the approximate one 
valid near the earth if 


GMm GMm (7.41) 
a =m = á 
Re+y sy RZ y 


upon using g = eM, 

But there is no way this equality can be correct, because the two sides 
have opposite signs! 

The resolution is that when you define a potential U, you are free to 
add a constant. In different schemes you may choose different constants 
without any contradiction, because only the difference U; — U2, in which 
the constant drops out, is ever invoked in a physical problem. However, if a 
direct comparison of U itself in two different schemes is made, there need 
not be any agreement. That is what is happening here. The person working 
near the earth chooses the potential U, that vanishes at the surface of the 
earth, y = 0, while a person doing celestial mechanics chooses a potential 


U, that vanishes at r= oo. Let c be the difference between the two schemes: 
Ue(r) = U(r) + c. (7.42) 


To find c, choose a point on the surface of the earth with r= Re and y = 0 
where U, vanishes: 


GMm 
mg-0=— +c (7.43) 
È 
which means 
GMm 
c= (7.44) 
Re 
GMm 
Ue(r) = U(r) + : (7.45) 


Re 
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Thus the exact potential, shifted by a suitable constant to make it vanish at 
y = 0, gives for Ue the exact expression 


GMm GMm 
U.(R =— 7.46 
e(Re + y) maa” R, (7.46) 
1 1 
= GMm (= — ) (7.47) 
Re Re+y 
y 
= GMm———— (7.48) 
Re(Re +y) 
GM 
= ma =ne (7.49) 
e 


upon ignoring the y compared to Re in the Re + y in Eqn. 7.48. 
Finally, consider the total energy of a circular orbit. Recall that for the 
radial direction we have from the second law 


m 
m— = so that (7.50) 


mv“ = A (7.51) 


So the kinetic energy is exactly half the magnitude of the potential energy. 
The total energy is 


1 , GMm 
E= -mv — (7.52) 
2 r 
GMm GMm GMm 
= — =— =—K. (7.53) 
2r r 2r 


So, for a particle in a circular orbit, the total energy is negative and equals 
half the potential energy or the negative of the kinetic energy. (This assumes 
we are using the potential U, appropriate for celestial mechanics, which 
vanishes at infinity: U-(r — oo) = 0.) You will find that even if you solved 
for the elliptical orbit, the total energy would be negative. This is a general 
property: an object that is never able to escape the sun’s pull, that keeps 
orbiting it, has negative total energy. Let us see why. A body with negative 
total energy E < 0 can never run off to infinity: at infinity the potential 
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energy is zero and the entire energy is kinetic, and that is supposedly nega- 
tive, which is impossible. So the assumption that a body of negative energy 
escaped to infinity is wrong. 

So, if you see a comet and want to know if it will come back again, 
add the kinetic and potential energies. If it’s positive, it won’t come back; 
if it’s negative, the comet is trapped. Zero is the dividing line, when the 
comet will collapse at the finish line at infinity. (See if you understand why 
you do not need the mass of the comet to perform this test.) 

Suppose you start at the surface of the earth and start shooting things 
upward. As you crank up the speed, they will go farther and farther away, 
and beyond some escape velocity ve they will never come back. What is the 
minimum speed for escape? You should ensure that it goes to infinity with 
no kinetic energy to spare. You want it to just manage to get to infinity, 
stagger, and fall down. Well, if it has no kinetic energy at infinity and no 
potential energy at infinity, its total energy is 0. Therefore, by the law of 
conservation of energy, the total energy at launch must be zero: 


1 , GMm . F 
0= J mv, — B` which determines ve to be (7.54) 
E 
2GM (7.55) 
Ve = i . 
e Re 


You can now go ahead and read some cutting-edge articles. For exam- 
ple, you have heard about dark matter, right? Most of the universe seems to 
be made out of stuff we cannot see. You, me, we all add up to a very small 
percentage of the total mass. How do people know there is dark matter if 
we cannot see it? By the use of v?R = GM, which relates the speed of an 
orbiting object at radius R to the mass inside the orbit that is pulling it into 
orbit. As you follow the trajectories of objects orbiting around the galac- 
tic center at radius R, the enclosed mass should grow with R, and it should 
stop growing once the orbit size passes the observed radius of the matter in 
the galaxy (except for a few odd balls like the one whose orbit we are fol- 
lowing). But it keeps growing for a considerable distance beyond, telling 
us there is a halo of dark matter. Even dark matter cannot hide its gravi- 
tational effect. Every galaxy seems to have a dark matter halo that extends 
beyond the visible part. 


CHAPTER 8 


Multi-particle Dynamics 


8.1 The two-body problem 


Next we begin our study of the dynamics of more than one body. You 
might think we already did this when we studied the solar system, con- 
sisting of the sun and all the planets. But we considered only one planet, 
and the sun just stood there as a source of the gravitational force. That was 
essentially a one-body problem. 

As usual, let me start with the simplest possible case of two bodies 
moving in one dimension. They have coordinates x; and x2 and masses 
my, and m as shown in Figure 8.1. The first body obeys 


— =F, (8.1) 


which I am going to rewrite as 


mıxı = Fi (8.2) 
mı m, 9 
@——_0—__0 0 ——_ 
x=0 XX %2 


Figure 8.1 A system of two bodies in d = 1 at xı and x with their CM at X. Far 


to the right is a planet. 118 
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where the two dots on X tell you we are taking two time derivatives, a con- 
venient notation if you take only a few time derivatives. Divide the forces 
on body 1 into two parts: 


mıxı = Fi2 + Fie (8.3) 


where F2 is the force on 1 due to 2 and Fj, the sum of all external forces 
on 1 due to everything else. Similarly for 2, and in the same notation 


mn = Fy, + Fre. (8.4) 


The universe has many bodies, and I have just picked these two as part of 
my system and lumped the rest under the label “external.” For example, a 
spring could be mediating a force between the masses, which could also be 
“falling” under the gravitational pull of some large planet far to the right 
on the x-axis. The spring is just a way of transmitting force from one body 
to the other; it is the source of the internal forces F,z = — F21. The external 
force Fe is the gravitational pull of the planet. Now I squash the spring 
and let go. The masses will both accelerate to the right under Fe and also 
oscillate relative to each other under the elastic force due to the spring, 
which I will refer to as simply spring force. 


8.2 The center of mass 


We are going to manipulate Eqns. 8.3 and 8.4 to get some interesting re- 
sults. Let us add their left-hand sides and equate them to the sum of the 
right-hand sides: 


mX + m% = Fiat Fie + Fait Fre. (8.5) 
I now invoke Newton’s third law, 
Fiz = — Fas (8.6) 


whatever the underlying force: gravity, spring, electrostatic, and so forth. 
This whole chapter is about milking this one simple result, this cancellation 
of Fiz and Fy. Next, I lump all the external forces Fie and Fze into one 
external force Fe: 


Fo = Fie + Fre. (8.7) 
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I then have this equation: 
mıxı + m% = Fo. (8.8) 


I multiply and divide the left-hand side by the total mass 


M=m +m (8.9) 
to obtain 
m| matma] -F, (8.10) 
M 

aX 
M——=F,, where (8.11) 

dt? 
a ead (8.12) 

M 


is called the center-of-mass coordinate or the CM. 

What I have done is correct, but why did I do that? I have introduced 
a fictitious entity, the center of mass. The center of mass has a location X, 
which is a weighted average of xı and x2: 


mıxı + mx m m 
Ja AR a A (8.13) 
M mı + m mı + m 


If mı = m = m, then M = 2m and 


Xt XQ 
“3 


X (8.14) 


lies midway between the particles. If mı > m, X will be closer to mı and 
vice versa. The weighted sum gives a certain coordinate, but there is noth- 
ing there. All the stuff is either at xı or x2. The center of mass is the location 
of a mathematical entity. It’s not a physical entity. But we care, because it 
behaves like a body. After all, if you were shown only Eqn. 8.11, you would 
say, “Well, this guy’s talking about a body of mass M undergoing some ac- 
celeration due to the force Fe.” Thus, the center of mass is a fictitious body, 
whose mass is the total mass of these two particles, and whose acceleration 
is controlled by only the total external force. This is the key. All the internal 
forces have canceled out, and what remains is the external force. 
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If you have three bodies, you can do a similar manipulation with ex- 
tra forces like F23 = —F32 and so on, and you will end up with Eqn 8.11 
with one obvious change: 


mıxı + nmh X + M3 X3 _ MX] + mM X2 + N3X3 


x= (8.15) 
M mı + m+ m3 
We write such sums in a compact notation: 
3 
x= i mii (8.16) 


which can be generalized to N particles by replacing the 3 by N. 

Once more with feeling: the CM responds only to the total external 
force; it doesn’t care about internal forces. Pll give an example. A couple 
of samurai are having a fight in an airplane, punching each other and 
so on. It is a flight and fight situation. The rest of the passengers get fed 
up and throw them out. So, they’re falling down, affecting each other’s 
dynamics. This samurai will feel a force due to that samurai, that samurai 
will feel a force due to this samurai, but the center of mass is still going to 
drop like a rock. It’s going to feel a force (mı + m2)g, and it will have an 
acceleration g. 

Suppose at some point one falling samurai cuts the other into two 
pieces. So now we have three bodies: the first protagonist and the other 
two, who used to be one. You can take these three bodies, find their center 
of mass, and it will be the same story—the center of mass will just keep 
accelerating at the same g, as if nothing happened. 

Indeed, if you were following the CM alone, you would see no sign 
of all this violence and the involuntary partitioning of the second samurai. 
The system is becoming more and more complicated, but nothing changes 
the dynamics of the center of mass, undergoing free fall under gravity. 

In summary, the CM can accelerate only due to external forces, like 
gravity in this example. If there were no external forces, then the center of 
mass would behave like a free particle. If it was not moving to begin with, it 
won't move later. If it was moving to begin with, it will maintain the initial 
velocity. 
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Figure 8.2 The CM vector R lies on rz — rj, the line joining the tips of rı and r2. 


If youre living in two dimensions, you define a CM vector R for two 
particles as follows and as indicated in Figure 8.2: 


r r 
kane ea (8.17) 
mı + m 


which is equal to two relations 


mıxı + Mm 7X2 


X = — (8.18) 
mı + m 

ya MAT mya (8.19) 
mı + m 


I have shown the CM situated on the line connecting the masses. 
Why? One way to understand this is to choose a new x’-axis that passes 
through the masses and a new y’-axis perpendicular to it. Since the masses 
only have an x’ coordinate, the weighted average must also have only an x’ 
coordinate. 

Here’s another example. You take a complicated collection of masses 
and springs, connected by ropes and chains and whatnot. You throw the 
whole mess into the air. All the different parts of it are jiggling and do- 
ing complicated movements, but if you follow the center of mass, in other 
words, at every instant you compute 


(8.20) 


it will simply follow the parabolic path of a body curving under gravity. 
If at some point this complicated object fragments into two disconnected 
chunks that fly off on their own paths, the CM will continue as before. 
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Here is a very useful alternative for finding the CM of a collection of 
masses. 


1. Take a subset of them and replace them with all their mass sitting at 
their center of mass. Replace the rest by their total mass sitting at their 
center of mass. 

2. Find the center of mass of these two centers of mass, properly weighted. 


Let us verify the equivalence of this to the standard definition, for the 
case of three bodies. The original recipe to compute X was 


x= mX + a + M3X3 (8.21) 
mı + m + m3 


Next we follow the new recipe and divide the system into two parts, one 
made of masses numbered 1 and 2, of total mass M12 = mı + m, and the 
third one by itself. We first compute the CM of 1 +2 following the standard 
definition: 


mıx Mox 
Xp = ee (8.22) 
Mı2 


and combine it with object 3 and see what happens if we follow the recipe 


m3x3 + Mı2X 
Xpecipe = Mt Mak 823) 
Be + mix + mX (8.24) 
mı + m+ m3 
=X, (8.25) 


Along the way, I have used Eqn. 8.22, which implies that M12X12 = mıxı + 
mM X2. 

Clearly, in a general case, we can subdivide the masses into more than 
two subsets and follow the same procedure. 

As long as you are dealing with a countable number of masses, find- 
ing the CM is just plug and chug, in one or higher dimensions. Things 
become more interesting if I give you an extended body, like a rod of mass 
M and length L, shown in the upper part of Figure 8.3. Where is the center 
of mass? We have to adapt the previous definition for discrete masses. The 
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dx 
SESE 
x=0 x 
L 
1 
CM 


Figure 8.3 (Top) A rod of length L and mass M. (Bottom) An L-shaped object 
made of two rectangles. The points 1 and 2 are the CMs where all their mass may 
be assumed to be concentrated. The point marked CM is the weighted sum of 1 
and 2. 


trick is to partition the rod into tiny pieces of length dx located a distance 
x from the origin, chosen to be at the left end of the rod. The mass of this 
segment is M dx, the product of the mass per unit length and the length of 
the sliver. Its location is x. 

Now, you might object that the segment extends from x to x+ dx and 
doesn’t have a definite coordinate. This objection is valid for any finite dx, 
but in the end, we will let dx — 0 and the objection will disappear in that 
limit. The CM is now given by the ratio of integrals, rather than sums as in 
Eqn. 8.20: 


_* M xdx (sae) 
fe Md Yar i 
1 

=Š di (8.27) 
L 2l 
L 

=L, 8.28 
5 (8.28) 


So the center of mass of this rod, to nobody’s surprise, is right at the 
midpoint. We knew this before doing the integral. What was behind that 
intuition? If you take the origin at the midpoint of the rod, you can argue 
that for every sliver with coordinate x, you have another equally mas- 
sive one at —x, and that the weighted average of these two points is zero. 
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The CM, which is the weighted average of all these zeros, also lies at the 
geometric center. 

This symmetry argument applies even for a two-dimensional body, 
like a rectangle. We can argue that its CM is at its geometric center. This 
is because for each tiny square segment of size dx - dy at (x, y) (measured 
from the geometric center), we can find an equal one at (—x,—y), and 
the weighted average of the two will be at the origin (0,0). The weighted 
average of all these zeros will also be (0,0). 

Now consider a rod whose linear mass density, or mass per unit 
length, varies as some function p(x). Then by the usual argument 


y= Sep (x)xdx 


= f (8.29) 

Jo p(x)dx 
For example, if p(x) = Ax, where A is some constant 

Jit Axxdx 

a (8.30) 
fo Axdx 
Eye 2L 

= ae =a (8.31) 


which is biased to the right as you would expect. 

Consider next an L-shaped object shown in the lower part of 
Figure 8.3. Where is its CM? We resort to the trick of first reducing each 
rectangle to its CM and then doing the weighted sum of the two CMs. By 
the symmetry argument, the CM of each rectangle is at its geometric cen- 
ter, the points numbered 1 and 2. We may imagine their total masses Mı 
and M; to be concentrated at these points. The CM is the weighted sum of 
these two point masses, and it lies along the line joining them. Its precise 
location is easily found if the masses and dimensions of the rectangles are 
known. 

Now for one final CM calculation, the most difficult one you are sup- 
posed to know. The object is a triangle of mass M, base 2w, and height h, 
as shown in Figure 8.4. It has an area 


1 
A= 57 wh = wh (8.32) 
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3 
2w 
f | 
dx 
h 
——<— 


Figure 8.4 The CM calculation of a triangle of base 2w and height h. It is 
viewed as a weighted sum over rods of width dx and height 2y(x). 


and a mass per unit area or areal density 


M M 


=—, 8.33 
p= (8.33) 


Where is the center of mass of this object? Again, by symmetry, you can 
tell that Y, the y coordinate of the center of mass, must be zero. For every 
tiny square dxdy with some coordinate (x, y), there is a matching one with 
coordinate (x, —y). For X, you have to do some honest work. We will divide 
and conquer. 

Let us imagine the triangle as composed of thin rectangles of width 
dx and height 2y(x), as indicated. (Each strip is not quite a rectangle, be- 
cause the edges are slightly tapered, but when dx — 0, they will reduce to 
rectangles.) The mass dm of the rectangle at a given x is 


M M 
dm = —2y(x)dx = —2y(x)dx, (8.34) 
A wh 


which is just the product of the mass per unit area M and the area of the 
strip 2y(x)dx. We find y(x) using similar triangles: 


w = z which means y(x) = T (8.35) 
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The weighted average of x is then 


x-1 i yoa (8.36) 

=i que ; 
1 h wx 

=— 2— .37 
T. p wae (8.37) 

_ 2 : 2d 

=a i x“ dx (8.38) 
2 

= —h, 8.39 
F (8.39) 


We could have anticipated that X would be skewed to the right, and 
this formula quantifies that intuition. Note in Eqn. 8.37 that this two- 
dimensional problem maps onto a one-dimensional one, with a linear 
density proportional to x, that is, p(x) « x. This is because each vertical 
strip may be replaced by a point mass on the x-axis proportional to y(x), 
which in turn grows linearly with x. 

To summarize, when we work with extended bodies or more than one 
body, we can replace the entire body by a single point for certain purposes. 
The single point is called a center of mass or CM. The CM is fictitious. 
It has a mass equal to the total mass. It has a location R that moves in 
response to the total external force: 


—— =F,, (8.40) 


The center of mass is not aware of internal forces, and that’s what we want 
to exploit. 

One class of problems has a net external force Fe, and there we 
know that the CM responds as a point to Fe, regardless of its constituents. 
For example, a jumbled mass of constituents tossed in the air follows 
the parabolic trajectory of a point mass, in response to gravity. This 
is just a one-body problem, which we have studied extensively. So we 
move on. 
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8.3 Law of conservation of momentum 


Now consider the case F, = 0. That means 


PR =0 (8.41) 
d i 
PA = some constant. (8.42) 


Let us multiply both sides by the total mass M, which is itself a constant, 
to obtain 


R 
M os = some other constant. (8.43) 


What is this constant? This question requires a digression into the 
concept of momentum. The momentum p of a single particle is given by 


p= mv. (8.44) 
We may rewrite F = ma as 


r dv dmv dp 
m— = m— = — = —, 
dt? dt dt dt 


(8.45) 


that is, the force is the rate of change of momentum. 
The CM, though fictitious, is endowed with a well-defined position 


mY) + mrz 
R= —— 8.46 
M ( ) 


and a velocity, given by differentiation of both sides 


dR m$ +m 


8.47 
dt M een) 
Let us define its momentum as we did for a real particle 
dR 
P= M— (8.48) 


dt ` 
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Now we find, using Eqn. 8.47, that 


P= u” z drı drz (8 49) 
=H ge | m O : 
=pitp2. (8.50) 


In summary, the CM momentum is the sum of the momenta of all the 
particles and only external forces can change it: 


PR dP, sij 
d? dt 7 ' 


If Fe = 0, the CM momentum P is conserved. This means the sum 
over all the individual momenta is conserved, as long as the particles 
interact only with each other, and nothing external. 

A classic example is two people standing on ice. Their total initial 
momentum is 0. The ice is going to support them vertically against gravity, 
but if it’s frictionless, it cannot apply any force parallel to the plane of the 
ice. For example, if you and I are standing on ice, and we push against each 
other and fly apart, my momentum vector has to be exactly the opposite 
of your momentum vector. 

Next consider a mass my that collides with a mass mp in the absence 
of external forces. Then P, the total initial momentum, equals P’, the total 
final momentum: 


P= mv, + mv = mV; + mv =P. (8.52) 


I use no primes and primes on the initial and final velocities rather than the 
labels 1 and 2, which are now used to distinguish the two particles. During 
the collision, one mass exerts a force on the other mass, but the other mass 
exerts an opposite force on the first. So the rate of change of momentum 
of one is equal and opposite at every instant to the rate of change of mo- 
mentum of the other. Hence the total momentum is unaffected, even if the 
individual momenta change. 
This is called the law of conservation of momentum. In terms of p, 


Pı +P2 = pi +P». (8.53) 
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Let us be sure we get it. Take a collection of bodies. At a given in- 
stant every one has its own velocity and its momentum. Add up all the 
momenta. If you are in one dimension, just add the numbers; if in two 
dimensions, add the vectors to obtain the total momentum. That total 
momentum does not change, if there are no outside forces acting. 

The law of conservation of momentum survived the revolutions of 
relativity and quantum mechanics, although the explicit formula p = mv 
did not. 

So far, we have considered the following two cases: 


e The CM is subject to an external force Fe Æ 0, and it moves in response 
like a single particle of mass M, as illustrated by the falling samurai. 

e The external force Fe = 0, and the CM has a non-zero velocity or 
momentum that is conserved. 


Now I finally consider a problem where there are no external forces, and 
the CM is initially at rest and stays put. 

Recall the dynamics of the sun and the earth. My earlier description 
of the sun sitting still and the earth moving around it is not acceptable for 
the following reason. The momentum of the sun, P,, is not changing, and 
it equals 0. The momentum of the earth, Pe, is changing as it orbits the 
sun. Their total momentum is therefore changing, which is not allowed, 
when the only force is their mutual attraction. Stated differently, what is 
wrong with our earlier description is that the center of mass of the sun and 
earth does not move uniformly in a straight line, or remain at rest, as it 
should. Instead, as the earth orbits the fixed sun, the CM, which lies on 
the line joining the sun and the earth, goes around the sun as well. The 
CM of the earth and sun should not accelerate. If it was initially at rest, it 
should remain at rest. If it had an initial velocity, it should maintain that 
velocity, and we can choose to view it from a frame in which it is at rest. 
(Going to a frame at a fixed relative velocity to our initial one will not alter 
our status as inertial observers.) Figure 8.5 portrays the correct description. 
I’m considering a solar system where the sun is not as big as ours relative to 
the planets, so I can show the center of mass distinctly. (Our sun is so much 
more massive than the average planet that the CM typically lies inside the 
sun.) When the two bodies are at 1 and 2, the CM is where it is as shown, 
and it will be there later when they are at 3 and 4. So both the bodies will 
revolve around their common CM, which stays put. 
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Figure 8.5 The sun (big circle) and a planet (small circle) orbiting around their 
CM in some solar system where the sun is not quite as massive as ours. The 
arrows give the instantaneous momenta at points 1 through 4. Our sun is so 
massive compared to most planets that the CM typically lies inside the body of 
the sun. 


Our earlier description with the fixed sun, while not strictly correct, 
is an excellent approximation to the truth for our solar system, given how 
much more massive our sun is than the planets (except for Jupiter). In the 
limit 4; — 0, the CM falls right on top of the sun, and neither moves. This 
was the limit we were tacitly assuming earlier. 

You have to be careful when you apply F = ma in the radial direction, 
for, say, the planet. You should write 


GMm m? 
z = (8.54) 
ri? ri 


where v is its velocity, 712 is the distance between the sun and the planet, 
and rı is the distance between the planet and the CM, the point about 
which it goes in its circular orbit. In other words, the force of gravity on 
the planet or the sun is a function of the distance between the planet and 
the sun, not the distance to the CM, whereas the centripetal force depends 
on the radius of the orbit, which is the distance to the CM. 

Next, a few more examples of a CM coordinate R that is free from 
forces and immobile. 

Consider a closed railway carriage of length L and mass M that con- 
tains a point horse of mass m. The horse is at the left end of the carriage, 
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which we choose to be x = 0. The CM of the carriage is at x = Z, The initial 
CM of the horse and carriage is at 


m:-0+M35L 
X =——__~_. (8.55) 
m+M 


Assuming the track is frictionless, the CM cannot move. 

Now the horse decides to walk over to the right end of the carriage. 
You will know something is going on without looking in, because when the 
horse moves to the right, the carriage will move to the left, to keep the CM 
fixed. Let us find out by how much the midpoint of the carriage moves. 
Say it goes to x = iL — 0. The horse then has a coordinate IL — ô+ iL, 
because it is now IL to the right of the center of the carriage. Equating X 
before and after, we find 


m:-0+MiL m- (L-8)+MGL-ò) 
m+M m+M 


(8.56) 


L L 
Mo = m-(L— 3) +M (5 -5) with a solution 


(8.57) 


L 
ja 1m (8.58) 
m+M 


Once again, if 7; — 0, the carriage will not move when the horse does. 

Yet another problem is depicted in Figure 8.6. There is a boat of 
length L and mass M, whose left-most point is located d meters from the 
shore. You have a mass m, and you are x meters from the left end. You 
want to jump ship. Rather than leap from where you are, you decide to 
walk to the left-most part of the boat and then jump, because you would 
rather jump d meters than d + x meters. Again, assuming the water exerts 
no horizontal force on the boat, you will find you have to jump more than 
d. If you move to the left, the boat will move to the right, to keep the CM, 
shown by a big cross, at a fixed distance X from the shore. When you arrive 
at the left edge of the boat, you will be more than d meters from shore. Let 
us find out how far you have to jump. 

Let D be the final distance from the shore to the left end of the boat, 
which is also how much you need to jump. The CM of the boat has a 
coordinate D + L, With the origin at the shore, let us equate the X before 
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Shore nr 


Shore m 


Figure 8.6 (Top) You (tiny circle of mass m) and the boat of mass M, before. 
(Bottom) You at the left end of the boat, which has shifted slightly to the right 
keeping the CM fixed at X. 


and after, after first canceling the total mass from the denominator in both 
sides: 


L L 
md+x)+M(d+ =) = mD+M(D+ =) (8.59) 
pa "Gtx +M _ j mx l 
M+m m+M 
(8.60) 


which is more than d and less than d + x. Thus it helps to walk to the edge 
of the boat, but not by as much as you would naively expect. 

Let’s ask what happens when you leap to the shore and are airborne. 
The CM cannot move, so if you zoom to the left, the boat will move to 
the right. Equivalently, the total momentum cannot change. Originally, 
the momentum was 0; nobody was moving. Suddenly you're moving to the 
left, and the boat has to move the other way. Of course, its velocity is not 
the opposite of yours; its momentum is. So, the big mass of the boat times 
the small velocity of the boat will be equal and opposite to your small mass 
times your big velocity. 

Now you have landed on the shore. Your momentum is 0. What’s 
happening to the boat? Is it going to stop now? No, the boat will not stop 
just because you hit the shore. The boat will keep moving because there’s 
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no force on it. The question you may have is, “How come there is suddenly 
momentum in the system when it had none before?” The answer is that an 
external force has now come into play: the force of friction between you 
and the earth. Previously, it was just you and the boat, and you couldn’t 
change your total momentum. But the ground is now obviously pushing 
you to the right because you, who were initially flying to the left, have 
stopped. So the combined system—you and the boat—have a rightward 
force acting during the time it took to stop you; it’s that momentum that’s 
carried by the boat. A better way to say this is as follows. You and the boat 
exchanged momenta: you push the boat to the right, you move to the left. 
Then your momentum is reduced to zero by the shore, and the earth as a 
whole takes up the missing momentum. The boat has no reason to change 
its momentum, and it keeps going. Can you calculate the speed of the boat? 
No. I only told you that you jumped and landed on the shore, and that’s 
not enough to predict how fast the boat will be moving. But if I told you 
your velocity when you leaped off the boat, then of course you know your 
momentum and can deduce that of the boat. 


8.4 Rocket science 


There is no use struggling through a physics course if at the end you cannot 
claim to do rocket science. So we are going to derive the rocket equation. 
Everyone knows that if you blow up a balloon and let it go, the balloon 
goes one way, and the air goes the other way. Action and reaction are equal; 
even lay people know that. I don’t want to go into the rocket problem in any 
great detail, just enough to get you familiarized with how the key equations 
are derived. 

Figure 8.7 part A shows a rocket whose mass at time t is M and whose 
velocity is v. The rocket emits gases, and the gases have a certain exhaust 
velocity of magnitude vo, pointing away from the rocket. The value of vo 
is fixed relative to the rocket, not relative to the ground. If you are riding 
inside the rocket and you look at the fumes coming out of the back, they 
will be leaving you at that speed vo. Their velocity relative to the ground 
will be v — vo: the velocity —vo relative to the rocket plus the velocity v of 
the rocket relative to the ground. 

A short time dt later, the rocket has a mass M — ô because it has lost 
some of its own body mass ô in the form of exhaust fumes. The rocket’s 
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Figure 8.7 (A) The rocket at time t as seen from the ground. (B) The rocket at 
time t+ dt as seen from the ground. It has lost mass 6 in the form of emitted 
gases and gained speed dv. The fumes of mass 6 (the blob) are moving at a speed 
(v — vo) relative to the ground. (C) The rocket and fumes at time t + dt in the 
frame moving at velocity v(t). The fumes have an exhaust velocity —vo in the 
rocket frame. 


velocity is now v + dv. Let us balance the momentum before and after: 
Mv = (M — 0)(v+ dv) + (v — vo)ô. (8.61) 


The left-hand side is clearly just the momentum of the rocket before this 
short interval. The right has first the new mass of the rocket times its new 
velocity. Next is the momentum of the blob of fumes emitted: its mass 6 
times its velocity relative to the ground of v — vp. I write ô to the right of 
(v — vo) so that when you open the brackets you do not make the mistake 
of thinking dv is a change in v. Let us open up the brackets and manipulate 
as follows (keeping 6 to the right): 


Mv = Mv + Mdv—v-d—dv-6+v-0—Vv:0 (8.62) 
which simplifies to 
vô = Mdv or (8.63) 


d 
aa (8.64) 
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Along the way, I have dropped a quantity dv - ô because it is quadratic in 
the infinitesimals, and hence negligible compared to the infinitesimals dv 
and ô. 

The next thing is to integrate these equations. The rocket emitted gas 
of mass 6 in time dt. The mass of the rocket then was reduced to M — ô. In 
the language of calculus, M(t) is the mass of the rocket at time t, and dM 
is the change in the function M: 


dM = M(t+ At)— M(t). (8.65) 
We have seen that 
dM =(M—6)—-M=-, (8.66) 


so that Eqn. 8.64 becomes 


dM d 
= which integrates to (8.67) 
M vo 
M, 
ie as (8.68) 
M vo 
M, 
v(t) = voln Sa (8.69) 


assuming the rocket had an initial velocity of 0 and mass Mọ. Remember 
that at large times, M(t) cannot fall below the mass of the empty rocket. 


8.5 Elastic and inelastic collisions 


A body of mass mı and velocity vı collides with a body of mass m and 
velocity v2, with all velocities shown as positive in Figure 8.8. Our goal is 
to find the final velocities v| and v5. We need two conditions to find two 
unknowns, right? The conservation of momentum is always good for one 
equation as long as there are no external forces like friction: 


MV] + mn = mV; + mv). (8.70) 


You need a second equation to solve for the two unknowns, and that’s 
where there are two extreme cases. 
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a mz v3 
mı 


Figure 8.8 (Top) A body of mass m; and velocity vı collides with a body of 
mass m and velocity v2, with all velocities shown as positive. (Bottom) The two 
bodies with final velocities v| and vj. 


One is called the totally inelastic collision in which the two masses 
stick together and move at a common velocity v; = v} = v’. That means 
there is just a single unknown v’, which we find by going back to Eqn. 8.70 


mV, + mv =(m,+1)v' witha solution (8.71) 
I mı vı + mn f (8.72) 
mı + m 


The other kind of collision is called totally elastic. Here the kinetic 
energy is conserved: 


l 2,1 2_ 1 2,1 2 
zi + 722 = zm) + zm) . (8.73) 


You can juggle Eqns. 8.70 and 8.73 and solve for v| and v5. The 
answer is 


= 2 
y= p =| vı + | | v2 (8.74) 
m + m mi +m 
m2 — Mm, 2m 
w= maj, +f 8.75 
i mı + m : mi +m ' 


Because one of these two simultaneous equations (Eqn. 8.73) is quadratic 
in the velocities and will not yield to the familiar trick for linear ones, here 
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is some help for those who want to derive Eqns. 8.74 and 8.75. Rewrite 
Eqn. 8.73 as 


1 1 1 1 
5m — sm (ry = zm — 322 (8.76) 
my (vi — vi) (vi + v4) = m (v — v2) (v2 + v4) (8.77) 


while Eqn. 8.70 says 
my (vı — v1) = mv — v2). (8.78) 
Upon dividing Eqn. 8.77 by this we get 


(w+) = (m+). (8.79) 


The last two linear equations 8.78 (which is just a rearrangement of 
Eqn. 8.70) and 8.79 may be readily solved for v| and v4 to yield Eqns. 8.74 
and 8.75. 

You have to be very careful in using the conservation laws. You cannot 
use the law of conservation of energy in an inelastic collision. For example, 
if two identical bodies with opposite velocities collide and stick together to 
form a single mass at rest, they have lost their initial kinetic energy. 

Here is an example that illustrates the proper use of conservation 
laws. You have a pistol and you want to know with what speed the bul- 
let comes out. In the old days, the following technique was used. Figure 8.9 
shows a ballistic pendulum. You hang a chunk of wood of mass M from 


Before After 


o_o 


Figure 8.9 A bullet of mass mand velocity vo plows into a suspended block of 


m 
@—> Vo 


wood of mass M, and the two of them rise as a pendulum to height h. 
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the ceiling. Then you fire the bullet of known mass m and unknown speed 
vo. The bullet rams into this chunk of wood, and the whole thing is set in 
motion. The combination is like a pendulum, which will rise up to a cer- 
tain maximum height A that you can easily measure. From that maximum 
height, you can calculate the speed of the bullet. 

You might be naive and decide not to care about intermediate details, 
and find vo by equating the initial kinetic energy of the bullet i mv to the 
final potential energy (M + m)gh of the block and bullet. That is 


1 
(M+ m)gh = 5% (8.80) 


where the only unknown is vo. That’s wrong because you cannot use the 
law of conservation of energy in the totally inelastic collision between the 
bullet and the block. Some energy will go into heating up the block. But 
you can use the law of conservation of horizontal momentum during the 
collision, since gravity cannot change the total horizontal momentum dur- 
ing the practically instantaneous collision. So in the first totally inelastic 
collision you may assert that 


mv = (M+m)V, (8.81) 
which determines the velocity V with which the block and bullet begin to 


swing as a pendulum. The pendulum then climbs to a height h with no 
further loss of energy, allowing you to write 


1 
5 (M+ m)V* = (M + m)gh. (8.82) 
Combining Eqns. 8.81 and 8.82 we find 


(M+m)gh= 5(M+ m) V? (8.83) 


o1 mvo 2o mM Vo 
=5+m| | ~ 2(M+m)’ ee 


The initial or muzzle velocity of the bullet is then 


vo = f + x| J 2gh. (8.85) 
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Notice that after the initial inelastic collision the energy of the block 
and bullet is 


2 ay 
M Vo 


l 4 m 
= -mê (8.86) 
2(M+m) 2 m+M 
1 


< 51> (8.87) 


which reflects the energy loss in the inelastic collision. 


8.6 Scattering in higher dimensions 


The problems we did on scattering explain how the basic conservation laws 
of energy and momentum are to be applied. But they fail to indicate the 
great importance of scattering experiments in our understanding of funda- 
mental physics. For example, we learned only from scattering that the atom 
has a hard nucleus. It was known, in the early twentieth century, that the 
negative charge of the point-like electrons was balanced by a compensating 
positive charge. In one popular model, called the “plum-pudding model,” 
the atom was a positive spherical charge, with the electrons embedded in 
it. The scattering experiments of Ernest Rutherford changed everything. 
By shooting a beam of alpha particles (helium nuclei with two protons 
and two neutrons) at a gold foil and detecting the scattered particles in 
various directions, he deduced the structure of the nucleus. In particu- 
lar, he was stunned to notice that some of the alpha particles came right 
back, turned around by 180°. No soft mush of positive charge (as in the 
plum-pudding model) could have done that. He then assumed the atomic 
nucleus was hard and point-like, contained all the mass and charge, and 
exerted a 1/r? electric repulsion on the alpha particles. He computed what 
fraction of incident particles would suffer a deflection by an angle 0 from 
the incident direction. The scattering data were in perfect agreement with 
his prediction and provided a brilliant confirmation of the point-nucleus, 
the cornerstone of Bohr’s model of the atom. (Luckily for Rutherford, his 
treatment of the scattering as a Kepler problem with a repulsive 1/r? force 
gave answers that coincided with that of the quantum treatment, which 
came many years later.) 

During the years 1966 to 1978 Jerome Friedman, Henry Kendall, 
and Richard Taylor, of the Stanford Linear Accelerator Center, shot high- 
energy electrons at nucleons (a common name for protons and neutrons). 
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The incident electron emitted photons of extremely short wavelength, and 
these were exquisite probes of nucleon structure at very short length-scales. 
They revealed that the proton was not a point charge but was made up of 
two “up” quarks, which carried a fraction A of the proton charge, and one 
“down” quark, which carried —} of the proton charge. Likewise, the neu- 
tron was made up of two down quarks and one up quark. These charges 
were in agreement with the assignments made by the inventors of the 
quark model, Murray Gell-Mann and George Zweig. (Besides the three 
quarks, the nucleons also contain a cloud of short-lived quark-antiquark 
pairs and “gluons,” which bind the quarks. Scattering can also probe this 
cloud.) 


In three dimensions, the law of conservation of momentum for a 
two-particle collision takes the form of a vector equation 


Pı +P2 = pi +p- (8.88) 


If the collision is totally inelastic, the final blob will have mass mı + m and 
move at a velocity V that satisfies momentum conservation: 


V= PitP2 (8.89) 
mı + m 
If the collision is elastic we may assume exactly as in d = 1, 
1 1 l 1 > 
gmi + ymn = zm + ym, (8.90) 
which may be rewritten in terms of momenta as 
2 2 152 I2 
Pi 4 P? (p1) 4 P) (8.91) 


2m, 2m 2mı 2m2 


since 


= = _my*. (8.92) 
2m 2m 2 


In d = 2, Eqns. 8.88 and 8.90 furnish three equations, not enough to 
determine the four numbers p and p4. This is because, unlike in d = 1, 
a collision is not fully specified by the incoming momenta. We need to 
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Gold nucleus 


Figure 8.10 An alpha particle approaching a gold nucleus with impact 
parameter b. If you assume the nucleus is immobile and exerts a Coulomb force 
1/r, you can calculate the direction @ in which the alpha particle will emerge, as 
a function of b and the energy. If a beam of particles with a distribution of b’s is 
given, one can predict the particle flux in any direction 0. 


know the impact parameter b shown in Figure 8.10, and the nature of the 
force between them (e.g., 1/ r?) if they are point-like, or their radii, if they 
are impenetrable spheres that exert a force on each other upon contact. 

There is one problem where we can make a definite prediction with- 
out knowing b. Let mı = m = m, and let us assume initially p2 = 0. Upon 
equating the lengths squared of both sides of 


Pı +P2 =P; +P) (8.93) 


we find for this case when p2 = 0, 


(pı +p2): (pit p2) = (Pi + pi) -Pi + pd) (8.94) 
pi = (P)? + Ph)? + 2p) - ph (8.95) 
while Eqn. 8.91 says 
rai = CA + TAE which means (8.96) 
pi P2 =0. (8.97) 


Because p/ - p = pi ph cos012 this means either the angle 012 between the 
final particles will be 90° or p| = 0, in which case the outgoing particle 2 
and the incoming particle 1 have exchanged momenta, clearly conserving 
energy and momentum. 

Observe that we do not have enough information to figure out the 
individual angles, just the relative angle 612. 


CHAPTER 9 


Rotational Dynamics I 


9.1 Introduction to rigid bodies 


In this chapter we graduate to objects like potatoes that are not point-like. 
For such extended objects, it is not simple to say where “it” is. We can pick 
a point on the object, like the CM, and locate it, but still we do not have 
the whole story. We need to say which way the potato is pointing, an issue 
we did not have with point particles. We could go all the way and consider 
a body like a snake, which is not only extended but also capable of chang- 
ing its shape. That is too hard, so I will focus on extended but rigid bodies, 
whose shape is fixed—like a dead snake. By definition, if you take any two 
points on a rigid body, the distance between them will not change with 
time. No body is exactly rigid, though Al Gore during debates comes close. 

The dynamics of rigid bodies in three dimensions is fairly com- 
plicated, so we will begin with two dimensions. Starting with one 
dimension will not work here, because no rotations are possible 
ing = 1. 

Consider a planar object—say a thin sheet of metal cut into some 
shape—that moves in the plane of the page, as shown in Figure 9.1. As you 
can see, it does something fairly complicated. Regardless of how it actually 
got from state 1 to state 2, we can attain the final state 2 from the initial 
state 1 by (i) moving the body until any one point F, which could be, but 
need not be, the CM, has reached its final location and (ii) following this 
with a rotation around an axis passing through F. The translation brings 
the body to the dotted configuration. The rotation axis passes through F, 
so as to attain the final state 2 without disturbing F, already in its final 
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position. There is a nonobvious generalization to three dimensions: we 
can go from the initial to the final configuration of a rigid body by a trans- 
lation, followed by a rotation by a suitable angle, about a suitable axis. Let 
us return to the planar body d = 2 for now. 

To specify it completely at any time, I have to tell you the coordinates 
of F and the rotation. To define the rotation, we draw an imaginary line 
on the body from F to any other fixed point, say the pointy edge, as shown 
in the inset of Figure 9.1. Then we say what angle 0 this line makes with a 
fixed direction, usually the x-axis. If I give you the location of F and 0, you 
can reconstruct the configuration of the body shown in Figure 9.1. 

Translations are what we have been studying all the time. We are ex- 
perts on that subject. So we want to focus initially on a body that is only 
rotating but not translating. Once we have sharpened our skills, we will 
introduce the translations. To this end, imagine that I have driven a nail 
through one point F. The nail keeps it from translating but not from ro- 
tating about the nail axis. This rotation is fully specified by 0, which is the 
rotational analog of x. 

We are going to set up an analogy between one dimensional trans- 
lation described by x(t) and two dimensional rotations described by 0 (t). 


Figure 9.1 A rigid planar body undergoing translation and rotation in a plane 
as it goes from state 1 to state 2. The dot represents the fixed point F. The solid 
line from the F to the pointy edge of the object allows us to follow its rotation. 
In state 2 the original orientation is shown in dotted lines and the curved arrow 
shows the rotation 0 needed to bring it to the final orientation. 
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You'll find the analogy very helpful. The number of things you have to re- 
member will be reduced if you learn to map the problem of rotations to 
the problem of translation in one dimension. 


9.2 Angle of rotation, the radian 


Even though the body is in two dimensions, you need just this one angle 
0 to specify its orientation. How do we want to measure 0? The standard 
preference in daily life is to measure it in degrees. When the body com- 
pletes one full revolution, we say it has turned by an angle of 360°. We’re 
going to use something else: radians. Why would anybody think of a ra- 
dian? What’s wrong with degrees? All the novels say, “I was going down 
the wrong path and then I did a 180.” 

Here is how a radian arises and why we like it. Consider a point on the 
body, at a distance r from the center at angle 0 from the x-axis as shown 
in Figure 9.2. You agree that if the rigid body is rotating, this point will 
be traversing a circle of radius r. How long is the arc s that it traces out 
from the time it was at 0 = 0? Here is one way to calculate it. If the point 
traversed a full circle, we know s = 2x r. If it has rotated by 0°, the arc 
length, by simple proportionality, is 


0 


— 
360 


Qarr). (9.1) 


Figure 9.2 As the rigid body rotates around some fixed point F, a typical point 
P at a distance r from F moves on a circle of radius r. 
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Someone then decided to make life easy by calling 0° in degrees times 
— as the same angle measured in radians. The angle in radians will simply 
be denoted by 0 with no superscript. That is 


_ 220° 


: 9.2 
360 oa 
In terms of 0 you find a very neat formula 

s=rð0. (9.3) 


The linear distance traveled, s, is simply the angle traversed in radians 
times the distance r to the axis of rotation. That a full circle is worth 
2a radians follows from Eqn. 9.2. Equation 9.3 agrees with the familiar 
expression C = 27 r for the circumference. 

If 360° = 2x radians, a radian is roughly 57.4°. It looks like an odd 
thing to pick, but it’s odd only if you start with 360 degrees. People from 
an alien culture might not use 360 at all. On the other hand, I believe 27 
would be discovered in any advanced civilization. 

You have to know a few popular angles in radians. For instance, a 

T T 


quarter circle is 90° = 7, while 60° = 4, 45° = 4, and 30° = €. 


If we take the time derivative of Eqn. 9.3, s = r0, we find 


ds _ do (9.4) 
ae de ' 


because r is a constant by the rigid body condition. Clearly A is the actual 
tangential speed vr, or the magnitude of the velocity in the tangential di- 
rection. If at this instant this point in the body separated and flew off, it 
would emerge with that speed tangent to the circle. Thus 


vr =ra@ where (9.5) 
do 

= 9.6 

°= H a 


is called the angular velocity and is measured in radians per second. It is 
positive for counterclockwise rotation. 
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What is the angular velocity of the moon as it goes around the earth? 
In roughly 28 days it covers 27 radians so 


2 
eo" __9 610 ad (9.7) 
28 x 24 * 60 x 60 


Remember, the rigid body as a whole has a single angular velocity, 
but the tangential velocity of a point a distance r from the axis of rotation 
varies as wr. 


9.3 Rotation at constant angular acceleration 


Let’s take a problem where the angular velocity œ is itself changing. We 
define the angular acceleration a by 


— do | do 7 
"= w de = 
measured in rad - s~*. Consider a rotating circular saw where its œ itself is 
changing. Any point on it, at a distance r from the center, has two compo- 
nents of the usual (linear) acceleration. First, even if œ is a constant, it has 
the centripetal acceleration directed toward the center: 


4r, = — = = wfr. (9.9) 


If in addition @ itself is changing, the tangential velocity vr = wr will also 
change, and there will be a tangential acceleration 


dvr = do 


— —=TA. wl 
a r a ra (9.10) 


aT 


If you are driving around a circular racetrack in a car, a non-zero ar will 
result when you step on the gas and see the speedometer needle move up. 
In addition, even at constant speed, the seat belt will remind you of the 
centripetal acceleration a,. 
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In summary here are the relations between angular and tangential 
quantities: 


s=r0 (9.11) 
vT = ro (9.12) 
ar = ra. (9.13) 


The bottom two equations follow upon differentiating the top one, bearing 
in mind that in a rigid body r, the distance of any point from the axis of 
rotation, will not change with time. 

Suppose you are standing on a rotating platform that is undergoing 
angular acceleration, and you want to know what frictional force you need 
to stay fixed on the platform. The answer is 


Fr = mar = mra (9.14) 


F, = —ma, = -mor (9.15) 


in the tangential and radial directions. 

We can borrow many results from linear motion described by x by 
making the obvious substitutions. For a body that has a constant angular 
acceleration a, in obvious notation, 


1 
0 =O) + aot + sat’ (9.16) 
o=e+at (9.17) 

wo” = œk +2a(0 — 6). (9.18) 


Suppose a chain-saw blade spinning at wo is brought to rest after n 
revolutions. What is a, assuming it was constant? We simply set œ = 0 and 
0 — @ = 27 n radians in Eqn. 9.18 and solve for a. 


9.4 Rotational inertia, momentum, and energy 


Now let’s find the kinetic energy of a rotating rigid body. It has a mass, and 
if it’s spinning, all the little particles making up the body are moving, and 
they have their own K = i mv. We want to compute the total K summed 
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M3 


Figure 9.3 A simple rigid body made of three point masses attached to a fixed 
hub by three massless rigid rods. When a force F; acts on 1, only the tangential 
component Fr; produces angular acceleration. 


over all the particles. For that purpose, it’s convenient to begin with the 
following simpler rigid body, shown in Figure 9.3. It is made up of masses 
mı, Mz, and m3 attached by rigid massless rods of length 7, r2, and r3 
to a fixed massless point-like hub about which the body can rotate. Say 
it is rotating with a common angular velocity w. The kinetic energy of 
this object is the kinetic energy of each mass summed over all the masses. 
Now, what’s the velocity of each mass? If you consider my, its velocity is 
necessarily perpendicular to the line joining it to the point of rotation. 
It has no radial velocity because rı cannot change. The magnitude of its 
velocity, entirely tangential, is vı = wr). In general for mass i 


Vj = OT; (9.19) 


and the kinetic energy of all of them is 


1 1 1 
K=-m v +-m v +-m ve 9.20 
2 1h) 7 249 2 343 ( ) 
21 
3 2 
= oo (9.21) 
I= 


$1 
= 2 zmia r. (9.22) 
i=1 
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In my example, the summation goes from i = 1 to i = 3; you can 
make up a body where the sum goes from 1 to 30,000. Often I will not 
indicate the range of values of i in the sum. 

Notice that in Eqn. 9.22, unlike m; and 1;, œ does not have a subscript 
i because it is the same for all parts of the rigid body. So, you can pull it 
out of the sum and write K as follows: 


3 
1 
K= zo) mir; (9.23) 
i=1 
13 
= z9 where (9.24) 
3 
I= > mir? (9.25) 


is called the moment of inertia. 

The moment of inertia is determined not only by the masses that 
make up the body but also by how far they are from the axis of rotation. If 
all the masses just fell on top of the axis, the body would have no moment 
of inertia, though it would have a mass. Likewise, if the same mass is spread 
out more from the axis of rotation, the moment of inertia will be more. It 
requires a calculation to find the moment of inertia. 

If someone says, “Here are the masses; please find me the moment of 
inertia,” you should say, “I cannot do it until you tell me the point around 
which you plan to rotate the body.” The moment of inertia is defined with 
respect to a point. The mass is just the mass, but the moment of inertia 
depends on the point with respect to which you're computing it. For ex- 
ample, in Figure 9.3 we are assuming the rotation is around the hub. If, 
however, it is around some other point, we would need the distances to 
that point to find the relevant moment of inertia. 

When we compare K = i my? to K= 51a’, we see that in the world of 
rotations, œ plays the role of v and I plays the role of m. It is then natural 
to construct the rotational analog of the momentum p = my, called the 
angular momentum: 


L=Io. (9.26) 
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Next we seek the rotational analog of F = ma= g. That’s going to be 
equal to some mystery object, which we will call the torque t. What does it 
equal? 


dL 
= — 27 
T Ji (9.27) 
d[Io] da 
= = [— 9.28 
dt dt ( ) 
=]ğ (9.29) 


where we can pull the I out of the derivative because it is not changing: 
even as the body rotates and the masses move, 1;, the distance of mass i 
from the axis of rotation, does not change. We will now relate the torque 
to the external forces acting on the masses by the following manipulations: 


t=la (9.30) 
=9 m ra (9.31) 
1 
= y Mj 1; 0. fi (9.32) 
i 
= » Mj ATi ïi (9.33) 
1 
= X Fri fi (9.34) 
1 


where in arriving at the last equation I have used the fact that mass rm; 
times the tangential acceleration ar; = rja is Fr;, the tangential com- 
ponent of the force on mass i. Note that the tangential component of 
the force is positive if it is in the direction of increasing angle, that is, 
counterclockwise. 

In summary, the torque is a sum of torques on each mass, and the 
torque on each mass is the product of the tangential component of the 
external force acting on it and the distance r; to the axis of rotation. (I say 
“external” because there are internal forces that keep the body rigid and 
that do not figure in this.) Figure 9.3 shows an example with just one force 
acting on mass 1. 
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We will follow the convention that a torque is positive if it will cause 
positive acceleration, that is, in the direction of increasing 0. 

Here are two illustrations, shown in Figure 9.4. The left half shows 
a merry-go-round that you want to accelerate. You should apply a force 
tangent to the wheel for maximum effect. If you apply a force in a general 
direction as shown by the bold arrow, the tangential part will contribute 
to t and produce some a, while the radial part and the internal forces will 
keep the body rigid and furnish the requisite centripetal force. 

The right half shows a door. People from some civilization have just 
invented the door; they have thought about hinges but have not quite 


Door with various knobs 


Merry-go-round 


Figure 9.4 (Left) A merry-go-round being pushed by a force F. Only the 
tangential part Fsiny produces torque and angular acceleration. The angle y is 
measured from (the continuation of) r to F. If we reverse the direction of F in the 
figure, y will exceed m and the torque will become negative and lead to clockwise 
acceleration. The radial part Fcosy is canceled by internal forces in the rigid 
body. (Right) A view from above of a door on hinges in the early days, with 

two possible placements of knobs. The figure shows a poorly placed doorknob 
numbered 1 and applied force (dotted line), and a better choice (number 2 and 
solid line). We can read t = Frsiny in two equivalent ways: r times Fsiny , the 
component of the force perpendicular to the separation from the axis r, or F 
times rsin y , the component of r perpendicular to the line of action of the 
applied force F. For computing torque, the doorknob may be taken to have zero 
size, so that F acts where r ends. 
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figured out where to put the doorknob. They say, “As the proud inven- 
tors of the door, we have complete freedom. Maybe we will place the knob 
right next to the hinges.” Then they realize that something is wrong—they 
are applying a lot of force but not getting anywhere. They finally get it: 
whereas force was everything in linear motion, for rotations, the place- 
ment is relevant. If you want to get your money’s worth, you have to take 
the doorknob as far as you can from the hinges and put it near the other 
end. But they are not done yet. They apply a force on the knob as far as 
possible from the hinges but along the line joining the knob to the hinge, 
the horizontal direction in the figure, and they end up ripping the door off 
the hinges. Eventually it dawns on them: if you want to get some serious 
rotation going, you should really go as far as possible from the pivot point 
and apply the force in the direction perpendicular to the vector joining the 
pivot point and the point of application of the force. All this is encoded in 
the definition of the torque as 


t = Frsiny (9.35) 


where y is the angle measured from r, vector from the axis to the point of 
application of the force, to the force vector F. In both the merry-go-round 
and the door depictions, the torque is positive because y < z. If we reverse 
F, the torque will change sign because sin y will be negative for y > m. 

We can read t = Frsiny in two equivalent ways: r times Fsin y , the 
component of the force perpendicular to r, the separation from the axis, 
or F times rsiny, the component of r perpendicular to F, as indicated in 
the figure. 

Now we turn to the concept of work. Once again, everything can be 
related to what we know for translations, simply reexpressed in terms of 
quantities more pertinent to rotations. Consider a force that acts on one 
of the masses. Its radial part will do no work, because it is trying to change 
r, which is impossible if the body is indeed rigid. Let its tangential part 
rotate the body by an angle d@. The arc length traversed is rd0 and the 
work done is 


dW = Frds = Frrd0 (9.36) 
=rqtd0 (9.37) 
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in accord with our expectations that the product of force and linear 
displacement should be replaced by the product of torque and angular dis- 
placement. Because K has the same meaning, the work-energy theorem 
becomes 


dK = dW =t d0. (9.38) 


9.5 Torque and the work-energy theorem 


Here is an example of 
dW = t d0. (9.39) 


Consider a pendulum that is simply a massless string of length / attached 
at one end to the ceiling and at the other to a bob of mass m, as shown 
in Figure 9.5. The angle 0 between the string and the vertical is initially 
0. I want to bring it to an angle 0. I want to know how much work is 
required. Consider some intermediate angle 0. The force T supplied by the 
string necessarily points along the string. The gravitational force acting on 
the bob may be resolved into the tangential and radial parts as shown. The 
radial part mgcos@ will be balanced by T. The tangential part mgsin@ has 
to be balanced by a force Fme that I must apply to keep it from slipping back 
to the vertical. (I displace the bob without giving it any kinetic energy.) If 


f cos 


Figure 9.5 A pendulum made up of a massless string length / and a bob of mass 
m. The force mg is resolved in the radial and tangential directions. The bob has 
climbed up a height h = l — Icos@. 
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I increase 0 by d0, this force moves a distance ds = ld0 along an arc. As 
d0 — 0, this arc becomes a linear segment of the same length, and the 
work done, which is force times distance, becomes 


dW = mgsin0 (ld0), (9.40) 


which may equally well be written as the torque times angular displace- 
ment: 


dW = (melsin@)dé = t dé. (9.41) 


The work done by me to bring it up to angle 6 is 
Oo 
W= / mglsinOdé = megl(1 — cos 0o). (9.42) 
0 


Here is a cross-check. From the figure it is clear the bob has climbed 
up a height h = l — Icos@ and its potential energy is U = mgl(1 — cos), 
which in turn equals the work done. The kinetic energy was always 0. 

Table 9.1 contains a list of correspondences between angular and 
linear dynamics. 

To this list we just need to add that the tangential displacement s, ve- 
locity vr, and acceleration ar are simply r times the angular counterparts, 
0, œ, and a. 

Note that all the previously described rotational dynamics simply follow 
from Newton’s laws. 


Table 9.1 Linear and angular quantities 


Entity Linear Angular 
Displacement x 0 
Velocity v (A 
Acceleration a a 
Inertia m I=) mr 
Momentum p= mv L= Iw 


Rate of change of momentum F = ma = a t =la = a = Frsin@ 
Kinetic energy im iI. wo 
Work dW = Fdx dW = t d0 
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You are now ready to do a variety of simple problems. For example, 
if a force F; is applied to mass 1 in Figure 9.3, what will be the angular 
acceleration? It is t/I. We know how to find I by summing mr? over the 
three masses, and t is just Frırı. Once you have a, if it is constant, you 
can compute w at later times and so on. There is really just one technical 
obstacle you have to overcome if you want to do rigid body dynamics, and 
that is to know how to compute the moment of inertia for all kinds of 
objects. If I give you 37 masses mj, each with a distance r; from the point 
of rotation, it’s a trivial thing. But often you are not given a discrete set of 
masses, but a continuous blob. Just as you did in determining the center of 
mass, you have to replace the sums by integrals. 


9.6 Calculating the moment of inertia 


Let’s take some rigid bodies and try to find their moments of inertia. First 
consider one-dimensional objects, a rod of length / and mass M shown 
in Figure 9.6. Taking the origin at the CM of the rod, we divide it into 
tiny pieces of length dx centered at some x, just as we did for the CM 


x=0 dx 
———S Ee 
x 
Z 
< * 


Figure 9.6 (Top) The moment of inertia of a rod found as the integral over tiny 
segments of width dx. (Bottom) The moment of inertia of a disk viewed as a sum 
over contributions from annuli of radius r and thickness dr. 
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calculation. The only difference is that now we consider the mass of the 
tiny segment times the square of x rather than x: 


1/2 M 
loy= J Tx dx (9.43) 
172 
M x 12 
=— > (9.44) 
l 3 [2 
a (9.45) 
127 l 


The simplest two-dimensional object to consider is a ring of mass M 
and radius R. To find Icm, its moment of inertia around the CM, which is 
the center of the circle, imagine dividing it into tiny pieces. Every one of 
the pieces is at the same distance R from the center. So the sum mir?, with 
every 1; = R, is just MR?. The mass was spread out, but luckily spread out 
in such a way that every part of it was the same distance R from the center. 

More challenging is the disk of mass M and radius R. To find Icm, 
we have to organize our thinking. We must think of the disk as made up of 
concentric rings of radius r and thickness dr, one of which is shown. The 
mass of this annulus may be found as follows. The area of the annulus is 


dA=a (r+ dr)? — r°) = 2r rdr + O(dr?). (9.46) 


Its mass is the mass per unit area times the area: 


M 
and its contribution to Icy is just this mass dM times its r?: 
M 2 Poe 
dIlcm = a Ree r-r°dr which integrates to (9.48) 
T 
R M 3 
Icm = -ier dr (9.49) 
0 mR 
MR? 
= ——, (9.50) 
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Suppose I made an error somewhere and obtained Icm = MR?. You 
should know it is wrong, because Icm = MR? is possible only if all the mass 
is at a distance R from the center, but we know some of the mass is a lot 
closer. So the moment of inertia for a disk has to be less than MR?. 


We saw that 
Icom = MP (9.51) 
CM = 12 : 


for a rod. If you plan to nail it at its CM and spin it, this is the rotational 
inertia you will be up against. But suppose you want to spin it around the 
left end? We repeat the old calculation of Icm but we measure x from the 
left end and integrate from 0 to l: 


l 
M 
Toj =f g% dx (9.52) 
0 
l 
M 3 
=z (9.53) 
MP 


. 2 : 
Notice that Igna > IcM = ME, More precisely 


172 
Tend = Icm + M B : (9.55) 


In fact I will be smallest about the CM. We will prove in the next chapter 
that, in general, the moment of inertia I of a planar object around any 
perpendicular axis that is a distance d away from the CM is Icm + M&?. 


CHAPTER 10 


Rotational Dynamics II 


10.1 The parallel axis theorem 


Let us recall what we have learned about rigid bodies that are confined 
to lie and rotate in a plane, such as a rod or a sheet of some metal cut 
out into some arbitrary shape. The body has a mass M. It can translate 
and rotate, but for now we nail a point on it to the plane and let it rotate 
about that axis, with plans to bring in translations later on. A single angle 
0, measured in radians, suffices to tell us what it is doing, because all it 
can do is rotate about the fixed point. This angle @ is to rotations what x 
was to translations. There is a corresponding angular velocity œ = æ and 


2 
acceleration a = #2 = £8 


= 47 = z The novel attribute, owing to the object being 
extended and not point-like, is its moment of inertia I. If we imagine it 
as being made of point masses mj sitting at a distance r; from the axis, 


we find 
I= ir. (10.1) 


Note that I is a quantity that depends on how the mass is distributed rela- 
tive to the axis about which it is rotating. It plays the role that mass did in 
translational motion. If the body is continuous, the sum is replaced by the 
corresponding integral. The angular momentum L 


L=Io (10.2) 
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plays the role of momentum p = mv, and F = ma = 2 is replaced by 
dL 
t = la = — (10.3) 
dt 


where the torque is defined by 


T= S riFisin yi (10.4) 
i 


where y; is the angle measured from rj, the vector from the point of rota- 
tion to the point of application of the force, and force vector F;. We may 
rewrite the expression for torque in two other useful ways: 


t=Fyr=Fr, (10.5) 
where F, is the component of the force perpendicular to r and r} is the 
component of r perpendicular to F. 

In our convention a torque was positive (negative) if it implied 


counterclockwise (clockwise) acceleration. 
The kinetic energy is 


K = “Ia. (10.6) 
2 
All the rotational equations, say the one for K, are simply the results from 
linear motion as applied to the constituents of the rigid body. 
Finally, at each point in the body, the tangential displacement, 
velocity, and acceleration are just r times the angular ones: 


s=r0 vr=Tro ar=ra. (10.7) 


We practiced computing moments of inertia of various objects (rod 
and disk) and concluded with the observation that for a rod 


L 2 
lt = Tou + M5 | : (10.8) 


We will now see that this is an example of a more general result: 
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The parallel axis theorem: The moment of inertia for a planar body 
about any axis is 


f= Icm + Md? (10.9) 


where d is the distance from the CM to the new axis. (In the rod example 
d= żŁ.) 

Here is the proof. As a warm-up, we will do it in d = 1 for a rod 
of a given linear mass density p(x), that is, where p(x)dx is the mass of 
an infinitesimal segment of width dx at the point x. Let us choose as our 
origin the CM, and measure x from it. Let the new axis pass through the 
point x = d, as shown in Figure 10.1. The moment of inertia about the new 
axis is 


eal p(x)(x — d} dx (10.10) 

Rod 

=i p(x)(x? + d? — 2xd)dx (10.11) 
Rod 


= p(x)x*dx+ af plxydx—2d | p(x)xdx (10.12) 
Rod Rod Rod 


= Icm + Md? — 2a | p(x)xdx. (10.13) 
Rod 


The first two terms are welcome: they are required in the theorem. The 
cross term is like a left-over part after you have assembled your bookshelf 
from Ikea. We need to get rid of it. Luckily it is zero, for the following 
reason. By definition 


Prog P (x) xdx 7 


MX (10.14) 
M 


/ p(x)xdx = M 
Rod 


where X is the CM. What is X? Our origin is at the CM itself and with 
respect to that origin, the CM has zero coordinate, X = 0. 
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Figure 10.1 (Top) The moment of inertia calculation about an axis a distance d 
from the CM, which is not necessarily at the midpoint of the rod if the density is 
not uniform. The rod is made up of segments of width dx, one of which is shown 
at a distance x from the CM and x — d from the new axis. (Bottom) The parallel 
axis now passes through a point with a vector separation d from the CM. The 
body is made up of tiny areas of size dx dy, one of which is shown, separated by r 
from the CM and r’ from the new parallel axis. 


Thus, the moment of inertia is the smallest with respect to the CM; 
any other axis adds an Md’. 

Now let us do it in d = 2. I remind you that the length squared of 
A+ Bis 


|A +B|? = (A+B)-(A+B) 
=A-A+B-B+2A:B=A2+B*+2A-B. (10.15) 


Let p(x, y) be the mass per unit area, let r be the position vector of a point 
with the origin at the CM, and let d be the location of the new axis. Clearly 
the position vector of a point relative to the new axis is 


r =r—d. (10.16) 
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Now we repeat the earlier proof with vectors galore: 


I= 1 p(x, y)|r — d|? dxdy (10.17) 
body 
= p(x y)(r? + È — 2d - r)dxdy (10.18) 
body 
= plnyyPaxdy +a f p(x, y)dxdy (10.19) 
body body 
— 2d. p(x, y)rdxdy 
body 
= Icy + Md*—2Md-R (10.20) 


where the last term vanishes for the same reason as in d = 1: R is the CM 
position in a coordinate system with the CM itself as the origin. 

Here is an illustration of the power of this result. Suppose that in- 
stead of rotating a disk about its center you wanted to hold fixed a point 
at its circumference. We cannot view the disk as a union of concentric 
annuli centered around the new axis as we did for Icy. Only a part of 
every annulus would fit into the disk, and we would need to figure out 
how much, if we want to compute I directly. Of course, we will do no 
such thing: we will invoke the parallel axis theorem to say, for a disk of 
radius R 


3MR2 


Tedge = Icm + MR? = (10.21) 


Consider for example a coin that is standing on its rim on some surface. It 
has just one point of contact with the surface. If we demand that there be 
no slipping, the point of contact cannot move relative to the surface as it 
rolls. So the coin will simply rotate around this point (at this instant) and 
the relevant I will be ai 


10.2 Kinetic energy for a general N-body system 


Now let’s turn to a result involving kinetic energy, whose derivation has a 
flavor similar to the previous one, which is why I want to present it side- 
by-side. Imagine a whole collection of masses mj, not necessarily forming a 
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rigid body. For example, they could be stars forming a galaxy. The formula 
for the kinetic energy of this collection is 


1 
K= > zivi» (10.22) 
í 


where v; are the magnitudes of the velocities v; as seen by some generic 
observer. These velocities will of course depend on the frame of reference 
of the observer. Let V cm be the velocity of the CM as seen by this observer. 
Consider a special observer who is co-moving with the CM. To her the CM 
is at rest. If v™ is the velocity of the object as seen by this co-moving 
observer, then by the law of composition of velocities derived in Chapter 2 
(Eqn. 2.44), 


vi = Vey +y™. (10.23) 


In this notation, the superscript CM means “in the frame moving with the 
CM.” No superscript means in the original generic frame, and the subscript 
CM or i means “of the CM” or “of the particle 7.” Thus yo“ is the velocity 
of particle 7 in the CM frame and v; is the speed of the particle 7 in the 
original generic frame. We now manipulate Eqn. 10.22 as follows: 


K= ; L miv? (10.24) 
; 
= 5 mVn +v (10.25) 
i 
=E mVt BD+: Dome! (10.26) 
i i 
z ; MV2y + K™ +0. (10.27) 


The first term is the kinetic energy of the center of mass, a point con- 
taining the entire mass of the object and moving with a velocity Vc. 
The second term is the K as measured by the co-moving observer who 
uses the velocities yoM. for particle i. If you were really traveling with the 
center of mass, and you made a measurement on how fast every particle 
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in it was moving, this would be the kinetic energy you would attribute to 
these particles. The third term in Eqn. 10.26 vanishes because 


Yomo = 0. (10.28) 
i 


Here is the reason. In any generic frame, by definition, 


MR= 5 miri. (10.29) 
i 


Taking the time-derivative of both sides we find, also in any frame, 


MV cm = > MiNi. (10.30) 


1 


Now apply this to the observer co-moving with the CM: 


MVM = > mie, (10.31) 
i 


The left-hand side vanishes because Von is the velocity of the CM as seen 
by a person riding with the CM. This means the right-hand side, which 
appears in the last term in Eqn. 10.26, also vanishes. 

Equation 10.27 is valid for any collection of objects, whether they 
be parts of a rigid body or stars in a galaxy. It is going to be very useful 
to us when we study rigid bodies that rotate and translate. In that con- 
text, it means that as far as the kinetic energy goes, we may simply add 
the K of the translational motion of the CM to the K of rotations about 
the CM. 


K = Kcm + Krot (10.32) 


10.3 Simultaneous translations and rotations 


Now consider a disk, say a tire in a car. There are different things you could 
imagine. Lift the car off the ground, and let the tires spin; that has rota- 
tional energy K = K;ot = 5 Icua*. That’s the pure rotation we have been 
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studying so far. Next, let the tire hit the ground and set the car in motion. 
Suddenly slam the brakes and prevent the tire from turning. The skidding 
tire still moves with a certain velocity, which is due to just the transla- 
tional motion of the center of mass, with K = Kcy = t mVŽy- This too is 
familiar, just like that of a point particle. 

In general, the tire will rotate around the center and also trans- 
late. The linear velocity of the center of mass and the angular velocity 
around the center don’t have to be connected in any way. Suppose you 
start your car on an icy road. The tire is spinning, but the car is not mov- 
ing. So, that’s a case of Vcm = 0, œ Æ 0. In the case of a skid, we have 
Vom #0, œ = 0. We are interested in the case where there is no slipping 
relative to the ground; the wheels are not burning rubber. Now there is a 
correlation between the angular velocity and the linear velocity, and the 
motion is called “rolling without slipping.” When the tire rolls without 
slipping, by the time it finishes one full revolution, the car will have moved 
the distance equal to the circumference of the tire. So every part of the 
tire touches the ground once, while the car moves a distance equal to the 
circumference. 

Let us calculate the velocity when the tire rolls without slipping. If it 
made f revolutions per second and in each revolution the car moves 27 R 
meters 


Vom = 2a Rf = œR (10.33) 


where I have used the fact that œ = 27 f. Now we have 


1 1 
K= 5 MVim + z Temo? because of Eqn. 10.27 (10.34) 
1 1 MR? 1 3MR? 
= 5 Mo’ R + = a o => 5 a (10.35) 
1 
= Feige” (10.36) 


because of Eqn. 10.21, the parallel axis theorem. 

This very interesting result tells you that you can view the entire en- 
ergy of the tire as due to pure rotation with angular velocity œ around 
this point of contact P in Figure 10.2. If the tire is indeed rotating around 
that point, it means that point cannot be moving. And I will now try to 
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T 2 Vem 
Vom 
wR V 
P cM 


Figure 10.2 A tire of radius R that is rolling clockwise at œ without slipping. 
At the contact point P, the CM velocity Vcy cancels the tangential velocity œR, 
while at the top, T, it doubles it. The point P is instantaneously at rest with 
respect to the road. 


convince you. How fast are the different parts of the tire moving? The car 
as a whole is moving at V cm. In addition, the tire is spinning. To find the 
velocity of any point on the circumference, you should add the tangential 
velocity of magnitude wR to the CM velocity. At P, the point of contact, the 
two cancel, while at the top T, they add to give 2V cm. Thus, in a car going 
past you at 200 miles per hour, there is a part with zero velocity and a part 
with a velocity 400 miles per hour relative to the ground. It’s not obvious 
that a zooming car has one part of it that’s not moving at all. But at every 
instant, the part of the tire that touches the ground has zero instantaneous 
velocity. 


10.4 Conservation of energy 


Suppose I release an object of mass M from the top of a hill of height h. If 
it is a point mass, I know its speed when it comes down: 


1 
Mgh= 5M’ (10.37) 


v= /2gh. (10.38) 


But if it is a coin of radius R that is rolling without slipping, when it reaches 
the bottom the center of mass cannot move at Vey = ./2gh because it has 
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to be rotating about its axis to avoid slipping. Some rotational energy is 
mandated. So now we write 


1 1 

Mgh= 5 MVim + 5 lomo (10.39) 
1 3MR? 13M 3M 

=~ __@* = 55 CR = — Vi (10.40) 


23 2 
[4gh 
Vom = = < /2gh. (10.41) 


This formula for the disk (of zero thickness) is also valid for a cylinder 
rolling about its symmetry axis, because the cylinder may be viewed as a 
coaxial stack of disks whose moments of inertia add. 

For a solid sphere, we can do this again using Icy = 2 MR? and the 
parallel axis theorem. How about a hollow sphere of the same mass? Argue 
that it will have a greater moment of inertia. The exact answer is = MR? If 
I roll a hollow sphere and a solid sphere of the same mass and radius, the 
latter will come down faster. 

You can readily imagine all kinds of new problems. You remember 
that in the loop-the-loop, the velocity squared of a point mass at the top 
of the loop has to be bigger than or equal to Rg if it is to follow the track. 
Suppose the object in question is a cylinder or sphere. The condition on the 
velocity at the top is still the same, but the height from which you release 
it has to be greater, because if it rolls without slipping, it also has to have a 
rotational energy correlated to its translational energy. 

To summarize, when rigid bodies move, they have translational and 
rotational energy. In general, they are independent numbers, but when you 
have rolling without slipping, the angular velocity and the linear velocity 
are connected by Vcm = @R. So, it’s not surprising that the total energy 
has a contribution from both, which you can write either in terms of the 
angular velocity or the linear velocity. In other words, if I know how fast 
the wheel is spinning, I can tell you how fast the car is moving. If I tell 
you how fast the car is moving, I know the rate at which the wheels are 
spinning. 


10.5 Rotational dynamics using T = d 


Let me take a simple example. A group of rogue toddlers decided their par- 
ents were not pushing their merry-go-round of mass M fast enough, and 
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F 


Figure 10.3 A rocket-powered merry-go-round of mass M. Each engine has a 
thrust F, mass m, and contributes mR? to the moment of inertia. 


so they decided to use rocket propulsion. Initially they had only one rocket 
of mass m, but it nearly blew the axle holding the platform. While the 
rocket did provide a torque, it also provided a force that would have caused 
linear acceleration of the entire system, but for the axle, which applied a 
counterforce. To avoid this, they got a second identical rocket and installed 
it diametrically opposite to the first, with its thrust oriented to assist the 
first as in Figure 10.3. The engines have a thrust F each and together they 
apply a torque t = 2FR. This will produce an angular acceleration 
T 2FR 


a@=—= (10.42) 
I $MR?+2mR? 


Given this, we can manufacture any number of trivial problems. For ex- 
ample, how many revolutions would the merry-go-round have completed 
after time t if it started from rest? It would have rotated by 


1 
A(t) = sat radians. (10.43) 


10.6 Advanced rotations 


Now we consider slightly more complicated problems. A block of mass m 
is supported by a massless rope that wraps around a pulley of radius R and 
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mg 


Figure 10.4 Free-body diagram of a block of mass m supported by a massless 
rope that wraps around a pulley of radius R and mass M. 


mass M as shown in Figure 10.4. We want to find out with what acceler- 
ation the block descends. Let T be the tension in the rope, as shown in 
the free-body diagram. We have the following equations of motion for the 
block (F = ma) and the pulley (t = Ia) and the no-slip condition relating 
a and a: 


mg — T = ma (10.44) 
1 

TR = Ia = 5 MPa (10.45) 

a= Ra. (10.46) 


I have chosen the down direction as positive for the block and clockwise as 
positive for the torque, in contrast to the usual. This should prepare you 
for dealing with unconventional choices. 

I have not shown the force exerted by the pivot supporting the pulley 
because this force does not contribute to the torque about the pivot: r1 =0. 
In Eqn. 10.46 I have related the angular acceleration of the pulley to the 
linear acceleration of the block. The mass can go down one inch only if 
the pulley turns by a corresponding amount to release one more inch of 
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rope. By taking derivatives, we equate the tangential acceleration aR to a, 
the linear acceleration of the mass. 

In Eqn. 10.45, I have used Icm = + MR°. Canceling one power of Rin 
that equation and adding it to Eqn. 10.44 we find 


1 1 
mg = ma + 5 MRa = mat 5 Ma (10.47) 
a=g—. (10.48) 


We can check our result as follows. If the block drops a height h, then 
its final velocity is, from the relation v= v +2ad, 


v? =2ah=2g—~—h, (10.49) 


which can be rearranged to give 


1 1 Mv? 

sm + 5 > = mgh (10.50) 
1 > LMR ,; 
Š ~~ o = meh, 10.51 
Pii 75 > o = mgl ( ) 


which respects energy conservation. 

Again we can add more complications. We could go back to the prob- 
lem of one block sliding down the incline and couple it to another block 
moving vertically. We could decide to assign a non-zero moment of inertia 
to the pulley. 


10.7 Conservation of angular momentum 


So far, we have looked at examples of t = Ia with t 40. Now I’m going to 
consider problems where the total torque on a collection of bodies is zero. 
Then because t = aL, the total angular momentum will be conserved. 
Here’s one example. There is a disk spinning around a spindle pass- 
ing through a hole at its center. The disk has some moment of inertia I, 
and angular velocity @ . Infinitesimally above this disk, and on the same 
spindle, is a second non-rotating disk with moment of inertia h. If the 
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upper disk falls on the lower one and the two rotate as one unit, what will 
be the final œ? Angular momentum conservation tells us 


hoa, +h-0=(h+bh)o (10.52) 
h 

w= 0]. 10.53 

ha ( 


This should remind you of the problem in translational motion 
where m slams into m, which is at rest, and the two move together as 
a unit following the totally inelastic collision. 

Here is a similar problem. A merry-go-round of radius R is spinning 
at œ and has angular momentum Lı = hæ; where h = IMR. I drop a 
point kid of mass m on its outer rim. The kid and the merry-go-round now 
rotate together. What will be the final œ? The kid has to be brought from 
rest to the tangential speed of the rim. The merry-go-round will apply a 
force on the kid to speed up the kid, and the kid will exert an equal and 
opposite force that will slow the merry-go-round. The torques will be equal 
and opposite. So we may use the conservation of angular momentum for 
the entire system, to find a: 


ho, = (+ mR?)o (10.54) 
ho 

Z ee 10.55 

O= mR (10:35) 


where mR? is the moment of inertia of the kid sitting at the edge. 


10.8 Angular momentum of the figure skater 


Consider a single particle with momentum p = mv. If there are no forces, 
the momentum mv cannot change, and that translates into a constant v. If 
the body could reduce its mass in half, it could double its velocity keeping 
the momentum constant. But bodies have no way of changing their mass. 
But if they are not rigid, they can change their moment of inertia, by rear- 
ranging their mass distribution. This way they can change their œ, keeping 
the product L = Iœ constant. 

Consider a figure skater with her hands extended outward, spinning 
at some rate, as shown in the left half of Figure 10.5. She is a rigid body with 
some J; and @ . Then the skater briefly becomes non-rigid as she raises her 


Rotational Dynamics II 173 


: Ç 


(I, w) (I= .5 l; w, =2W,) 


Figure 10.5 Left: A spinning figure skater with her hands extended outward. 
Right: Raising her arms over her head, the skater has reduced her moment of 
inertia and increased her angular velocity, without changing their product, her 
conserved angular momentum. 


hands over her head and reduces her moment of inertia to h and angular 
frequency to œz such that 


Ta, = haz. (10.56) 


If she extends her arms out again, she slows down. Though she is non- 
rigid during some of this exercise, her L is conserved because there are no 
external torques. 

However, her kinetic energy changes in this process. For example, let 
her initial moment of inertia with arms extended be double the final one 
with arms drawn over her head: I, = 2h. This means 


@ = 20] (10.57) 
l ʻo I/A 5 
Ko = eae = 5 ie. (2@1) = 2 Ki; (10.58) 


The final kinetic energy is double the initial one. While the halving of the 
I compensates the doubling of the œ where angular momentum is con- 
cerned, this is not so for K, which is quadratic in œ. Where does the extra 
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energy come from? It comes from her muscles that apply the centripetal 
force as she pulls in her arms. The centripetal force is always there to 
keep her extended arms moving in a circle, but it does work only when 
her arms move radially, for that is when dW = F- dr Æ 0. (We ignore the 
gravitational work needed to raise her arms.) 


CHAPTER ll 


Rotational Dynamics III 


11.1 Static equilibrium 


I’m going to consider cases where there is no external torque. If there’s no 
torque, we know the angular velocity is constant. But Pm going to take 
a case where this constant value of angular velocity is itself zero. There is 
no motion, there is no torque. So you might say, “What’s there to study?” 
Well, sometimes it’s of great interest to us to know that the object has no 
angular velocity, for example, if the object is a ladder we have climbed. The 
ladder better not have any angular acceleration either. What does it take to 
keep the ladder from falling over? That’s the kind of problem we’re going 
to discuss. 

Obviously the forces on the ladder should add up to zero; otherwise 
F = ma will ensure the CM accelerates. But F = 0 is not good enough, as is 
clear from the rocket-propelled merry-go-round: the thrusts are equal and 
opposite and cancel as forces, but the torques they produce add. 

Because the cause of rotation is the torque, we want the total torque 
to vanish as well. So the condition for static equilibrium of a rigid body is 


y= (11.1) 
y= (11.2) 


yor =0 (11.3) 
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where the index 7 in the sums runs over all the applied forces in the x and 
y directions and all torques. 


11.2 The seesaw 


Look at Figure 11.1, which shows a seesaw with a kid of mass my, sitting 
to the left of the supporting pivot P and similarly a kid numbered 2 sitting 
to the right. Let these kids have coordinates x; and x with respect to the 
origin P. First we write down all the forces on the seesaw. We see F) = mg, 
F, = mg (where g = —9.8jm/s”) due to the weight of the kids. Of course, 
that cannot be the whole story, because the seesaw is not falling to the 
ground. There is the pivot P, which exerts an upward normal force N. 
The force equations are 


0=0 along the x direction (11.4) 
N+F,+F,=0 along the y direction, (11.5) 


where F; and F, are the y-components of F; and F2. Assume that F; is 
known and that we are trying to solve for Fy. Clearly we cannot solve for 
F, and N using just one useful equation. This is where the torque equation 
comes in. 

Now, if a body is rotating about an axis, we are used to equating the 
torque around that axis to the rate of change of angular momentum about 
that axis or point. But what if it is not rotating? About what point should 
we compute the torque? You can say it’s not rotating through one axis, 


N 
“1 X2 
w 2 
Á 
=m, F5=m28 


Figure 11.1 Two kids on a seesaw. The supporting pivot provides a force N 
upward. The total torque due to the kids’ weights Fı = mg, F2 = mg and the 
pivot force N is zero. 
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compute the total torque with respect to that axis, and equate it to zero 
to get the third equation. But then someone else can say it is not rotating 
about some other point and get yet another equation. So we can write 
down an infinite number of equations, one for each choice of pivot point 
or axis. But we have only two unknowns, F, and N, and they can only 
satisfy two equations. So, we have to hope that the extra equations we get 
by varying the pivot point all say the same thing. And that’s what I will 
show you. 

First consider just this rod in equilibrium. It has forces F4,... Fy 
acting on it, and these add up to zero. They act at positions x1, .. . xN, mea- 
sured from some point on the rod I have chosen for torque computation. 
That it is in equilibrium according to me implies 


YF =0 (11.6) 


1 
‘=F n=) Figo. (11.7) 
i i 


If you choose a different point along the rod for torque computation, you 
will replace every x by x — d if your axis is to the right of mine by d. The 
torque you get will be 


f=) FuQi-d)=> ouy Fu =0+0, (11.8) 


where the first zero is there because it is just the total t according to me, 
and the second zero arises because the total force Fr = ) >; F; is zero and 
so is its perpendicular component Fr. 

The proof gets tricky if the rotation axis is chosen off the rod or off 
the x-axis. We will address it later. 

Anyway, when you compute the torque, you may pick any point you 
like provided the total force is zero. However, some choices are better suited 
than others for some purposes. For example, if you were asked simply to 
find F, but not the normal force N, there’s a particular choice of pivot 
point that is optimal. That choice is the one in which this normal force 
doesn’t get to enter the torque equation and corresponds to computing 
the torque about the pivot point P, where the normal force N acts. Now 
N will drop out of the torque equation, because r} = 0. In general, al- 
ways compute the torque relative to a point where an unknown force is 
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acting, because then the unknown force cannot contribute to the torque 
equation. 
Following that rule we get 


F,x, + Fox = 0 (11.9) 


where x is negative and x is positive. Because F; and F, are negative, F; x1 
will be positive (counterclockwise) and F2x will be negative (clockwise). 
We can readily solve Eqn. 11.9 for F2. Once we have it, we can go back 
to Eqn. 11.5 and find the normal force N. Suppose F; = —100 Newtons, 
xı = —4 m, and x = 5 m. Then F, = —80 Newtons, and it follows from 
Eqn. 11.5 that N = 180 Newtons. 


11.3 A hanging sign 


Now, Figure 11.2 depicts a more complicated problem. There is a rod of 
length L and mass m, supported by some kind of pivot on the wall at one 
end and a wire anchored to the wall at the other end. The rod will even- 
tually support a sign, which was not ready in time for this problem. My 
concern is the tension on the wire, which I do not want to exceed some 


Wall 


Figure 11.2 A rod of length L supported by a wire with some tension T and a 
pivot on the wall that applies a force P. 
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limit. The straightforward way to find T is to write the free-body diagram 
for the rod and demand that the net force and torque vanish: 


P,,— Tcos@ =0 (11.10) 
Tsin + Py — mg =0 (11.11) 
tTp+twttm=O0 (11.12) 


where the three torques are due to the pivot, the tension on the wire, and 
the weight mg about any point. 

We have three equations for three unknowns: Px, Py, T. But if all I 
want is the tension T, I can get it from just the torque equation if I choose 
as the point for its computation the pivot itself. Then P drops out and we 
find 


L 
Tsing L=mg: 0 with a solution (11.13) 
mg 
T= j 11.14 
2sin@ ( ) 


Observe that I am following the standard convention that a counterclock- 
wise torque is positive. 

In a practical problem, you may have a rod of weight mg = 2000 N 
and a wire that can only withstand a tension Tmax. Then the computed T 
must obey 


mg 
= 2 x 
2sinð nie 
mg 


max 


(11.15) 


sinf > (11.16) 


If Tmax = 4000 N, this gives us 


1 
sin > I (11.17) 


180 Rotational Dynamics II 


Now, if I am curious, I can go back and find out P, and P, from the 
force equations: 


0 
Pate (11.18) 
2sin0 
Py = mg — Tsing =~. (11.19) 


Interestingly, the wire and the pivot each support half the weight of the rod 
no matter what @ is. 

Notice that I computed the torque due to gravity as if all the mass 
was concentrated at the CM. This is true but not obvious, because the def- 
inition of the CM as the weighted sum of the masses in the body does not 
necessarily mean the CM can also stand in for the whole body in comput- 
ing t. You should always go back to first principles and check. The only 
thing we know is F = ma and the force of gravity. Let us next imagine di- 
viding the rod into tiny pieces, each of which is small enough to say it has 
a definite location x measured from the pivot and a mass T dx, exactly as 
in the CM calculation. The total torque about the pivot is then the integral 


L L 
=) same f x dx= gmX. (11.20) 
o L o L 
Except for the factor g, the second integral is familiar from the CM calcula- 
tion as mX, where X = 2 is the CM. There the x entered as the coordinate 


to be weighted; here it is the r} for torque calculation. This t = mgX is in- 
deed what we would find if we lumped all the mass at the CM. This result 
is valid even if the mass density p(x) is not a constant; the integral would 
still be mX. 


11.4 The leaning ladder 


The last of the equilibrium problems is the one I mentioned in the begin- 
ning of this chapter. I have a ladder of mass m and length L leaning on a 
wall at some angle 0, as shown in Figure 11.3. The wall is frictionless and 
the floor has a static coefficient of friction 4s, which you may assume is 
given to you. I have not climbed the ladder yet; I just want to rest it against 
the wall and make sure it does not slide down. Can you see intuitively that 
0 has to exceed some lower limit for the ladder not to slide? Let’s just write 
down all the equations for equilibrium to find this limit. 
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Figure 11.3 A ladder of mass m and length L is leaning against a frictionless 
wall and standing on a floor with some coefficient of friction us. The normal and 
frictional forces due to the floor are shown separately. 


We know mg can be assumed to be acting at the center of the rod. 
The frictionless wall can only exert a force perpendicular to itself, called 
W for wall. The floor is exerting a normal force N and a frictional force f 
directed inward to balance W. That’s it. The force equations along x and 
y are: 


f-W=0 (11.21) 
N-—mg=0. (11.22) 


All that is left is the torque, and that’s where we have the choice. We can 
take the torque around any point we like. If we want to punish ourselves, 
we can take a torque around some crazy point and then N and f will all 
come into the problem, and we will have to work that much harder. The 
trick, again, is to take the torque where the ladder touches the floor. Then 
N and f are gone, and what do we have? We have mg trying to rotate it 
clockwise and W trying to rotate counterclockwise. Let us use the first of 
the two definitions of the torque 


t=Fryp=F ir. (11.23) 
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For the weight r} = Z cos@ and for the wall it is Lsinĝ. So we have 


L 
mE cos = WLsin@ with the solution (11.24) 


W= TE cotd. (11.25) 


But now we have a restriction: the frictional force must obey f < uN. 
Consider the two equations: 


f < usN = usmg (11.26) 
Jows TE cotd, (11.27) 


which lead to the inequality 


= cot < umg (11.28) 
1 
tan@ > : (11.29) 
2ls 


That means @ has to be bigger than some number, because tan 6 al- 
ways increases with 0 for the angles under consideration. For example, if 
ls =.5, we need 0 >= 45° = T If I want to climb up the ladder, I need to 
add my weight to the force and torque equations. The torque will depend 
on where I am on the ladder, and I will need to make sure I can climb all 
the way up. 


11.5 Rigid-body dynamics in 3d 


Rigid-body dynamics is quite easy as long as everything lies in the plane: 
all the mass is concentrated on the plane, all the forces are in the plane; 
all the vectors separating the point of application of the force to the point 
of rotation are in the plane. As we look down the plane, the only possible 
rotation is clockwise (negative) or counterclockwise (positive) about an 
axis perpendicular to the plane. The torque is t = Fy r = r} F where r is 
the distance to the axis from the point of application of the force. 
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That was then. But now we have to deal with the fact that in real life 
objects like tops and potatoes are not just planar and move in three dimen- 
sions. This is one case where not all the essential ideas can be conveyed in 
anything fewer than three dimensions. 

So we will go to d = 3 but stay with a single point mass for a while. 
More complicated bodies can be built out of point masses. What is the 
analog of t = Ia when the mass is running around in d = 3? How do you 
define torque and angular momentum in d = 3? 

Suppose I have a point mass m in d = 3 at a location r with respect to 
some origin, as shown in the left half of Figure 11.4. Let F be the force on 
it. As in d = 2, we want to combine the two vectors r and F and get a torque 
out of it. We want the magnitude of the torque to be rFsiny as in d = 2 
because the two vectors still lie in one plane, which could have been our old 
d = 2 plane. But now we need to specify the orientation of this common 
plane in d = 3 to specify the direction of the torque. A natural choice is the 
normal to the plane. But there are two possible orientations of the normal! 
We break the tie by defining the torque to point in the direction in which 
a screw will advance if turned from r to F. This completely defines the 
torque vector T. What we have arrived at is called the cross product of r and 
F, written as 


T=rxF. (11.30) 
In general C, the cross product of two vectors A and B, 
C=AxB, (11.31) 


points in the direction in which a screw would advance if you turned it 
from A to B, and it has a magnitude 


C= ABsiny, (11.32) 


where y is the angle measured from A to B in their common plane, as 
shown in the right half of Figure 11.4. 

This rule for fixing the direction of the cross product is called the 
right-hand rule for the following reason. If you grab the z-axis with your 
right hand and your fingers curl around it from A to B as shown by the 
big arrow, the torque points along the thumb, or the positive z-axis. This is 
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Plane of r and F 


V=Vs-Va 


Figure 11.4 Left: The unit vectors i, j, and k shown for later use. We consider 
the torque t due to force F with respect to a point (the origin) from which it is 
separated by r. The vectors F and (the continuation of) r (dotted line) define the 
shaded plane in which angle y is measured from r to F. The magnitude of t is 
assigned as in d = 2: t = Frsiny. The vector t = r x F, called the cross product 
of r and F, is oriented perpendicular to the plane in the sense dictated by the 
right-hand rule. Right: The cross product of A and B (whose common plane is 
chosen to be the x — y plane for convenience) points in the direction a typical 
screw would advance if turned from A to B, as indicated by the curved arrow 
wrapping around the z-axis. It has a magnitude ABsin y , where y is the angle 
measured from A to B. Instead of the screw rule, one can invoke the right-hand 
rule: if the fingers of the right hand curl from A to B, the thumb points along the 
cross product C. 


why we homo sapiens can teach our children the cross product and lower 
primates can’t. 

Let us briefly digress to gain familiarity with the cross product. 

Whereas the dot product of two vectors that yields a scalar is defined 
in any number of dimensions, the cross product of two vectors that yields a 
vector makes sense only in d = 3. This trick of using two vectors to define 
a unique direction perpendicular to the plane they define works only in 
d = 3. In d = 4, there will be two independent directions perpendicular to 
the plane defined by any two vectors. 
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Back to the cross product. If we reverse the order of the factors in the 
cross product it changes sign: 


BxA=-AxB (11.33) 


because the screw will advance the opposite way in B x A, or equivalently, 
y will change sign. If we set A = B, then we find 


AxA=-AxA=0 (11.34) 


in agreement with the result C= ABsin y , which also yields 0 when y = 0. 
Indeed, the cross product vanishes even if B is any scalar 7 times A: the 
cross product of two parallel vectors is zero. This is good because the parallel 
vectors do not define a plane. 

Let us compute the cross product for the basis vectors depicted in the 
left half of Figure 11.4. Consider i x j. It points along k by the right-hand 
rule and has unit length because |i| - |j| sin 5 = 1. Here is a table of the nine 
possible cross products of the unit vectors: 


ixi=jxj=kxk=0 (11.35) 
ixj=k=-jxi (11.36) 
jxk=i=-kxj (11.37) 
kxi=j=-ixk. (11.38) 


Recall that we wrote the dot product in two equivalent ways: 


A-B=ABcos@ = AxB, + A,B, + AzBz. (11.39) 


We now want to write the cross product in terms of components instead 
of lengths and angles. We can guess the formula for A x B in terms of 
components by placing them both in the x — y plane or choosing the x — y 
plane so the two of them lie on it. If the vectors make angles y4 and yg 
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with the x-axis as shown in Figure 11.4, the cross product C has only a 
z-component given by 


Cz = ABsin(yp— ya) (11.40) 
= AB(sin ygcosy,4 — cosygsiny,) (11.41) 
= Acos y4 Bsin yg — Asin y4 Bcos yg (11.42) 
= AxB, — AyBy. (11.43) 


It is not hard to guess and to verify that if the vectors had been in 
some arbitrary plane so that C had all three components, the complete 
answer would have been 


Cy = AyB, — A,B, (11.44) 
Cy = A;By — AxBz (11.45) 
Cz = AxB; — AyBy. (11.46) 


Once you have one component, say Cz, you get the next by making the 
cyclic change x > y, y > z,z — x everywhere. 
In this form it is clear that 


(A+B) xC=AxC+BxC. (11.47) 


For example, if F = (A + B) x C then for the x-component, following 
Eqn. 11.44, 


Fy, = (Ay + By) Cz — (Az + Bz) Cy (11.48) 
=AyC+ By/C,;=A,C)= B,C, (11.49) 
= (AyC, — AzCy) + (B,C, — B,Cy), which means (11.50) 

F=AxC+BxC. (11.51) 


Thus the cross product is distributive: the brackets can be opened out 
as with ordinary products of numbers. The reverse is also true: like terms 
can be grouped into brackets. 

Now back to torque. Here are two applications of Eqn. 11.47. 
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First we will show that if the total force F on a body vanishes, and 
the total torque T} is zero about one point, it will be zero about any other. 
Suppose relative to one point we have 


t=) xR =0. (11.52) 
i 


(Notice that I have used 0 instead of 0 as is commonly done.) Switching to 
another point located at ro changes every r; into rj — ro: 


t=) (ti—ro)xF;=tr-rox X F;=0+0 (11.53) 
i i 


where we have pulled out ro from the sum over 7 using the distributive 
nature of the cross product. 

Next consider the torque on a body in a uniform gravitational field g, 
which means simply that every mass m will experience a force mg. Near the 
earth, g = —9.8 k ms~?, if the vertical direction is along the z-axis. Assume 
the body is made of countable masses m; at positions r;. The torque is 


r=) yxmg (11.54) 
2 p ma; xg (11.55) 
=MRxg (11.56) 


where R is the CM. Thus the torque behaves as if all the mass M were 
concentrated at the CM position R. 
Now for the definition of angular momentum. Unlike 


dæ dL 
t=la=I—= 


= 11.57 
dt dt ( ) 
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in the planar case, we will now have a vector equation: 


dL 


ie 11.58 
T dt ( ) 


Evidently the angular momentum L has to be a vector because T is a vector. 
In the plane we saw that 


L= Io = mro = m(ro)r = mvır (11.59) 


for an object that is part of a rigid body and hence forced to go around 
in a circle of radius r, with its velocity vector always perpendicular to the 
position vector. What if the object in question is not part of a rigid body? 
Then it could have a velocity that is not necessarily perpendicular to the 
position vector. The rigid body case suggests we keep only the part of its 
velocity perpendicular to r in the magnitude of L: 


L= mvr = mrsiny (11.60) 


where y is the angle between v and r. The siny gives us the clue for the 
formula for the vector L in d = 3: 


L=rxp. (11.61) 


This cross product would also arise naturally if we were guided by math- 
ematics and sought a way to combine r and p to form a vector. Of course 
(—19) times this answer would also be a cross product, but the above- 
mentioned choice of L has the essential feature that its time derivative is 
the torque T: 


L d 
al: _ air xp) (11.62) 
dt dt 
dr dp 
Ta aP 11.63 
T xp+rx n ( ) 
=P xp+rTt (11.64) 
m 


=0+T. (11.65) 
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These steps call for some explanation. By working with components you 
may satisfy yourself that the derivative acts on the cross product r x p just 
as it does on the ordinary product of two functions, provided we keep the 
order of the two factors r and p the same. Next I have used the fact that 
because the velocity and momentum are parallel (related by m), their cross 
product vanishes. The bottom line is that 


a 11.66 
T= P (11.66) 

In the case of a planar body that is made up of many point masses 
in the plane indexed by 1, each one of them has its own angular momen- 
tum L;, each of which must be pointing out of the plane or into the plane: 
because r; and p; both lie in the plane, their cross product points perpen- 
dicular to that plane. That is why both torque and angular momentum 
were treated as scalars and not as vectors. Once a vector is constrained to 
lie up or down one axis, we can forget about the fact that it is a vector. We 
just say it’s plus if it’s up and minus if it’s down. 

In the case of a point mass that belonged to a rigid body, angular 
momentum had to do with actual rotations around some axis. But the 
definition L = r x p holds for any particle with linear momentum, even 
if it is not going around in a circle, even if it is going in a straight line. 
Is this idea consistent with the older one? Consider Figure 11.5. Suppose 
a piece from the edge of a counterclockwise-rotating disk of radius r flies 
off from the 3 o’clock position and moves linearly along the tangential or 
y-direction. Before the fragmentation it had 


Li = mri = pı = fır. (11.67) 


We believe it will maintain this L because there are no forces or torques 
acting on it. Indeed it does: pı = p2, because there are no forces and even 
though r? > r1, ni = rı, =r, the radius of the disk. 

So, every moving particle will have an angular momentum about any 
given point, unless the line of its momentum vector goes through that 
point. Thus, even if the particle had never been part of the disk but simply 
traveled along the dotted line with a momentum p = pı = p2, it would 
have had the constant angular momentum L = r x p with respect to the 
origin. 
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P2 


Figure 11.5 Angular momentum is defined by L = r x p whether or not the 
particle is actually going around a point. The figure shows a point at the 3 o’clock 
position at the rim of a rotating disk. It has an angular momentum L; = rı x pı. 
Now it flies off along the tangent to the position rz and has the angular momen- 
tum Lz = r? x p2. Because there are no forces or torques on it when it flies off, we 
expect that Lı = L2. This is true: even though r2 > rı in magnitude, the 
component perpendicular to the momentum is unchanged and equal to the 
radius r of the disk, while pı = p2 for a free particle. 


Consider now a collection of bodies. Each one obeys 


dL; 
Gp TTS tet Dt (11.68) 
J 


where T je is the torque on 7 due to external forces and T;j the torque on it 
due to force Fj; exerted on it by particle j. If we sum the terms over i, we 
get for the total angular momentum 


dL 
ew tet ix By (11.69) 
bJ 
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Suppose the total external torque Te = 0. Then is al = 0? We can still 
use F;; = —Fj; but we have different factors of r; and r; that multiply the 
forces. Let us first consider just two particles i= 1 and j = 2 and see why 
L is conserved. (The argument applies unchanged with more particles.) 
Consider the internal torques and use F21 = —F)2: 


rı x F2 +12 x Fy) = (rı — r2) x Fp. (11.70) 


It will vanish provided the cross product vanishes and that in turn is as- 
sured if rı — r2 and Fp are parallel, that is, the force between the bodies 
points along the vector separating them. This is true for the case of gravita- 
tion, as we have seen, and true for the electrostatic force as well. One could 
argue on philosophical grounds that if i and j were the only two particles 
in the universe, the only possible direction for their mutual force to point 
along is their separation vector, unless the universe has some other intrinsi- 
cally preferred direction. We believe it does not, and the conservation of L 
follows from this isotropy of space. Of course, once there are other bodies 
there is no reason the mutual force between any two could not be affected 
by the others. But it seems that is not the case, at least in classical physics. 
In other words, once we find how a pair of particles interact in isolation, 
we do not need to modify that interaction in the presence of additional 
particles: each pair continues to interact as before. This is another lucky 
break for us. 


11.6 The gyroscope 


The gyro is one problem where the math rather than your intuition will 
serve you better. Here is the gyro in Figure 11.6. Focus on the side view 
of the gyro. The gyro is a massless rod, supported by the tower at one end 
and attached to a cylinder of mass m at the other. The cylinder is free to 
spin about its axis, but first assume that it is not spinning and that I am 
propping up the cylinder with my finger. Now I let go. Everybody should 
know what follows: the rod and gyro will start rotating clockwise in the 
vertical plane about an axis perpendicular to the page. You don’t need to 
take Physics 200 to know that. Let us make sure that agrees with tT = a. 
The torque t = r x mg has a magnitude t = mgl and is directed into the 
page by the right-hand rule. We should be very clear that when angular 
momentum points into the paper, it doesn’t mean the gyro goes into the 
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Figure 11.6 Top: The gyro resting on a tower, spinning with angular 
momentum L as seen from the side at t = 0. The rotating cylinder has all the mass 
and moment of inertia. Gravity exerts a downward force and a corresponding 
torque going into the page. Bottom: The view from the top at times + 5 dt. The 


bold arrows indicate the direction of rotation of the cylinder as seen from the top. 
The gyro has acquired a dL = tdt perpendicular to L. The horizontal vector 

L(0) = L is not shown to prevent cluttering. The angles are exaggerated in the 
figure for clarity. To first order in the infinitesimals |L + $d)? = I? because 

L- dL = 0. Thus L simply gets rotated by an angle dé = al in time dt. 


paper. It just means the axis around which the gyro rotates is pointing into 
the paper. 

This is it, if we have a gyro and it does not occur to us to spin the 
cylinder. 

So let us spin it. Now it’s a different ballgame. The gyro has an initial 
angular momentum. Which way is it pointing? That’s the first thing you 
have to understand. Every part of the cylinder is spinning, and the angular 
momentum of every part—if you do the r x p—will point radially out, in 
the direction of the total L as shown. 

Let us look at the impact of the gravitational torque over a time 
dt symmetrically chosen around t = 0. The top view shows the gyro at 
t= Fidt. The angular momentum change dL = tdt is going into the pa- 
per in the side view and pointing straight up in the top view. The angular 
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momenta at t = —Fdt,0, and idt all have the same length to first order in 
the infinitesimals: 


1 1 
[L£ 54L? =I’ +L- d+ Jl? = L? (11.71) 


because L- dL = 0. As usual we drop the last term |dL|? « dt? because it is 
a quadratic infinitesimal. 

That’s why the gyro doesn’t fall even though it acquires a dL. For a 
static rod, getting an angular momentum going into the paper really means 
swinging down. But if a rotating gyro has an angular momentum to begin 
with, and we add on a perpendicular change, the new angular momentum 
is just a rotated version of the old angular momentum. So, when seen from 
the top, the gyro will be executing what’s called precession. The gyro itself 
will just slowly go round and round with one end resting on top of the 
tower. 

Does this remind you of the satellite going around the earth versus 
the dropped apple? Both accelerate toward the earth. In the case of the 
apple, the gravitational acceleration produces a downward velocity that 
adds to zero initial downward velocity, and the apple simply picks up speed 
toward the earth. A satellite, on the other hand, has a large velocity in the 
tangential direction to begin with, and adding a tiny radial bit only changes 
its direction as it moves to a different part of its circular orbit a little bit 
later. 

Same thing with the gyro. It gets the same change in angular momen- 
tum dL whether or not the cylinder is spinning. But in one case, you add 
dL to zero and conclude it’s going to swing down; in the other case, you 
add dL to a non-zero L and conclude that L will rotate. 

Let us calculate Qp, the frequency of precession. Look at the lower 
half of the figure, the view from the top. In a time dt, the dL vector grows 
to a length tdt and L itself rotates by dé. Applying the formula for the arc 
length ds = rd¢ to the triangle made of L— dL/2, dL, and L+ dL/2, 


tdt = Ldd, which means (11.72) 
d l 
Q,= Ta a (11.73) 
dt L  Icya@ 
where Icy = +mR? for the rotating cylinder. (You may worry that dL is a 


chord and not an arc, but this difference vanishes when dL — 0.) 


CHAPTER 12 
Special Relativity I: 


The Lorentz Transformation 


Although the general public associates the theory of relativity with Ein- 
stein’s monumental work of 1905, it is actually a lot older, going back to 
Galileo and Newton. According to the relativity principle, two observers in 
uniform relative motion will deduce the same laws of physics. That view 
of relativity has remained unchanged even after Einstein. However, in the 
Galilean version, the laws considered were those of mechanics, which was 
pretty much everything in those days. In the nineteenth century, it began 
to look as if the laws of electromagnetism and light did not respect the rel- 
ativity principle. Einstein then rescued the principle, but he threw many 
cherished Newtonian ideas of space and time under the bus in the bargain. 
His 1905 work is referred to as the special theory of relativity, in contrast 
to his general theory, which came out in 1915. It was a theory of gravi- 
tation, and it is universally considered one of the greatest feats of human 
imagination and invention. 

We will limit ourselves to special relativity, once again working with 
the minimum number of spatial dimensions, which happens to be just one. 
Time, which was once viewed as an absolute parameter, will turn out to be 
an additional dimension, in a sense to be made precise later. 
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12.1 Galilean and Newtonian relativity 


The standard pedagogical technique for explaining relativity is in terms 
of some high-speed trains. Imagine two such (infinite) trains parked in 
parallel tracks in the station, along the x-axis. You board one train and see 
the other at rest. All the blinds are now closed; you are not to look outside 
yet. You settle down and explore the world around you. You pour yourself 
a drink, you play pool, you juggle some tennis balls, you play with your 
mass-spring system, and so on, and you develop a certain awareness and 
understanding of the mechanical world. Then you go to sleep. While you 
are sleeping, one of two things happens: either your train is left alone or 
some unseen hand gives your train a velocity of 200 miles per hour. The 
question is, when you wake up, can you tell which of the two things has 
happened without looking outside? Will anything inside this train betray 
that velocity? Now, you might say, “I know I am not moving because the 
sign up there says Amtrak.” That is a kind of sociological reason, not a 
physics reason based on experiments. The claim of relativity is that you 
just will not know if you are moving or not. However, if the train picks up 
speed or slows down, you will know right away. If it picks up speed, you 
find yourself pushed against the back of the seat; if the driver slams on the 
brake, you will slam into the seat in front of you. So accelerated motion 
can be detected in a closed train without looking outside. The question is 
whether uniform velocity can produce detectable changes inside the closed 
train. The answer is no, according to relativity. 

There is an equivalent way to say that everything looks the same when 
you wake up: Newton’s laws continue to be valid in the moving train. If 
Newton’s laws are valid, everything mechanical will look the same. Our 
expectations of what happens when, say, two billiard balls collide, or a 
mass and spring system oscillates, are all based on Newton’s laws. So the 
claim is that Newton’s laws will be unchanged when this uniform velocity 
is imparted to you. 

Recall that if Newton’s laws worked for you, you are called an iner- 
tial observer, and your frame of reference is called an inertial frame. Not 
all observers or frames are inertial. For example, F = ma won't be true 
in an accelerating train: You leave things on the floor and they will slide 
backward. So with no apparent force acting on them, things will begin to 
accelerate. That’s why an accelerating train is a non-inertial frame. We are 
not interested in that situation. So assume you started out in the station 
as an observer for whom the laws of Newton hold. The claim is they will 
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hold when you wake up, even if in the meantime you have been given a 
uniform velocity. 

Remember the train standing next to yours in the beginning. Suppose 
when you wake up, you open the blinds and see the other train moving at 
200 miles per hour. That there is motion between you and the other train is 
an undeniable, experimental fact. The question is, can you tell if your train 
is responsible for this relative motion? Maybe nothing happened to you 
and the other train is moving the opposite way at 200 miles per hour? The 
relativity principle states that you really cannot tell. You can only tell that 
there is relative motion between the two trains that was not there before, 
but not “who is really moving.” 

This means that if you observe relative motion between you and the 
other inertial observer, you have every right to insist that you are not mov- 
ing and the other observer is moving the opposite way and vice versa. But 
I repeat: You can make this argument only for uniform relative motion. If 
your train is accelerating, you cannot say “I’m not accelerating; the other 
train is accelerating in the other direction,” because you are the one who is 
slamming your head on the seat in front of you, and nothing is happening 
to the other person. Or if you are in a rocket that is taking off and the G 
forces are enormous, you are in danger, not everyone on earth. 

Now, back to the train. Suppose you look outside the window on the 
other side, and you see some cows going 200 miles per hour. Now what? I 
know you are thinking it must be the train traveling at 200 miles per hour. 
That conclusion is again based on non-physics notions. You are forget- 
ting that it is logically possible that somebody put the whole landscape on 
wheels when you went to sleep, making the cows outside the train appear 
to move the opposite way. It is not likely that someone would go to such 
lengths just to fool you, but if they did, you would not know. 

That’s why in these thought experiments we don’t like to open the 
window and look at the landscape, because then we have a bias. We will 
just look at the other train. Then, when you detect relative motion, you 
really cannot tell who is actually moving. This is clearer if, instead of two 
trains, you have two spaceships in outer space in relative motion and no 
other background. 


12.2 Proof of Galilean relativity 


I will now show that the laws of mechanics that you will deduce after wak- 
ing up will be the same, namely Newtonian, even if you have acquired 
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a uniform velocity while you slept. This will explain once and for all why 
everything mechanical will look the same when you wake up. First we need 
to do some groundwork. 

We begin with the notion of an event. An event is something that 
happens at a certain place at a certain time. For example, if a firecracker 
goes off somewhere at some time, the x is where it happened and the t 
is when it happened. So (x,t) are coordinates in spacetime. Once again 
spacetime does not require Einstein’s involvement at all. Even cave dwellers 
organizing a party knew that you have to say where (next to the dead T-rex) 
and you have to say when (at sunset). The fact that you need x and t, or, if 
youre living in three spatial dimensions, the fact that you need x, y, z, and 
t, is not new. What is new will be clear soon. 

Figure 12.1 shows two frames of reference, with their x-axes aligned. 
All the action is along the x-axis. We draw a perpendicular y-axis for ped- 
agogical reasons, but we rarely deal with the y coordinate. I am in frame S 
and you are in frame S’. 

Actually S is not just me sitting at the origin, x = 0: I am part of a 
huge team of people who are at rest relative to me. I have my agents all 
over the x-axis; they are my eyes and ears. If there’s a firecracker exploding 
to the right, my guys will tell me, even if I am not personally there. So when 
I say I see something, I really mean my buddies and I, all traveling in the 
same train at the same speed, spread all over space, taking notes on what’s 
happening. We may have to pool our information later, but we will know 


Figure 12.1 The same event (solid dot) is assigned coordinates (x, t) by me (S) 
and (x’, t’) by you (S’). In the pre-Einstein days t = ¢’ not only initially (when our 
origins crossed and clocks were synchronized) but always. At time t, your origin 
is to the right of mine by an amount ut. So the firecracker going off at location x 
at time t according to me happens at x’ = x — ut according to you. 
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this explosion took place at (x, t). In short, it takes a village to measure the 
spacetime coordinates of events. And you, sitting at your origin x’ = 0, also 
have your own village of informers in S’. 

Now, imagine you are sliding to the right according to me at a uni- 
form velocity u. You start out to my left; you pass me; and then a little later 
you are somewhere to the right. We arrange it so that when you pass me, 
that is, when our origins cross, you set your clock to zero and I set my clock 
to zero. So here’s an event: You and I crossed. What are the coordinates for 
that event? According to me, that event occurred at my origin x = 0, and 
the time was chosen to be t = 0. According to you, because your origin was 
also on top of my origin, x’ = 0 as well. In Newtonian mechanics there is 
one time for all of us, and it was chosen to be 0 when our origins crossed. 
We made the coordinate of the event, our crossing, (0,0) for both you and 
me. 

Now for a second event: a firecracker going off, shown by a black dot 
in Figure 12.1. I assign to it coordinates (x, t). 

What do I expect you to say? You measure the distance from your 
origin, and you call this point x’; the time is still t = t. What’s the relation 
of x’ to x? Well, this event took place at time t, so I know that your origin 
is off to the right by u times t. So the distance from your origin for this 
event, I maintain, is x’ = x — ut. In summary we have 


x =x— ut (12.1) 
t =t. (12.2) 


Henceforth we will denote the common time by t. This is called the 
Galilean transformation of coordinates in honor of the man who played 
a major role in articulating the principle of relativity. 

Eqn. 12.1 may be inverted to express my x in terms of your x’: 


x=x + ut. (12.3) 


So far x and t have been unrelated, and they could be chosen inde- 
pendently. At any time t, the coordinates of an event x can have any value. 
Let us now imagine that I am using (x, t) to describe a moving particle, say 
a bullet, and that x(t) is its location at time t according to me. Likewise 
x'(t) is its location according to you at the same common time t’ = t. By 
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the previous logic, the functions x(t) and x’(t) also differ by ut: 
x’ (t) = x(t) — ut. (12.4) 


Let v= me be its velocity according to me. Then w, the velocity according 
to you, follows from differentiating Eqn. 12.1 with respect to t and bearing 
in mind that u is a constant: 
B dx! _ dx B (12.5) 
a a aa 
This is how velocities transform in Galilean relativity. This agrees with 
common sense and our day-to-day experience. For example, if I say the 
bullet is going at v = 600 mph, and you are going to the right in your train 
at u = 200 mph, you should measure the bullet speed to be reduced by 
200 mph, and you should find that w = 600 — 200 = 400 mph. Taking one 
more derivative of Eqn. 12.1, we find for the acceleration of the moving 
object 
ex x 


d(t)= Ape a(t) (12.6) 


because u is a constant. That means you and I agree on the acceleration of 
the body. We disagree on where the body is. We disagree on how fast the 
body is moving. But we agree on the acceleration of the body, because in 
going from me to you all we do is add a constant to all velocities. Therefore, 
if according to you the velocity of the body is constant, according to me 
the velocity of the body is also a constant, but a different constant. Or, if the 
body has an acceleration, we'll both get the same acceleration. This fact is 
key to determining the fate of Newton’s law when we change frames. 

Imagine that I study, in my inertial frame, two bodies with coordi- 
nates x; and x2, which exert a force that depends on the distance between 
them, say F(x; — x2) = A/|x; — x2|, where A is some constant with units 
Newton-meters. The masses obey 


moti. A (12.7) 
de |x —xl 
d? A 
mo e a, (12.8) 
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where F2 = —F; in accordance with the third law. We have already seen 
that we can replace my accelerations on the left-hand side by your primed 
ones. But we can also replace my coordinate differences on the right by 
your primed ones because this is implied by x = xı — ut and <5 =x% = ut: 


Ixi — 25] = [1 — ut) — (x — ut)| = |x — x2]. (12.9) 


Thus my Eqns. 12.7 and 12.8 imply identical equations for you, in S’: 


PAPE j 
m Me ; (12.10) 
d? |x, — x5 
dx, A 
m =— 12.11 
de Ix, - 4 ( ) 


It should be clear that this argument is quite general. You should try to 
apply it to a mass-spring system seen in the two frames, using the fact that 
the extension or compression of the spring is the same for both observers. 

So this is the trick. Newton’s laws work in my inertial frame, S; my 
unprimed coordinates obey them. I can then use the Galilean transforma- 
tion to show that you, in S’, will find that your primed ones will obey them 
as well. 

We can also say it differently. If you wake up from your nap in a mov- 
ing train and examine the world around you, you are going to get the same 
Newtonian laws as before. This can be proven by a person on the ground 
using the previous argument with you as the observer S’. 

This is the way one can prove the principle of relativity in Newtonian 
mechanics. 

Finally, suppose that the relative velocity u is not constant. Then the 
accelerations will not agree because a’ = a — ae and F = ma will fail in the 
accelerating frame. The person will see objects accelerating without any 
force acting on them and know she is non-inertial. 


12.3 Enter Einstein 


We now fast forward about three hundred years. By this time André- 
Marie Ampère, Michael Faraday, Carl Gauss, and others had discovered 
the basic equations of electricity and magnetism. James Clerk Maxwell 
condensed these findings into his compact “Maxwell’s equations” and, by 
modifying one of them to achieve mathematical and physical consistency, 
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predicted electromagnetic waves. The wave velocity that Maxwell calcu- 
lated, 3 - 108 m/s, was widely known at that time to be the velocity of light, 
c. It was then correctly concluded that light must just be an electromag- 
netic wave. And the question was, for which observer is this c = 3- 10°m/s 
going to be the velocity of light? 

Let me elaborate. Later we will calculate the velocity of waves on a 
string clamped at two ends as a function of the tension on the string and 
the mass density. That’s the velocity as seen by a person in the lab, one for 
whom the string is at rest except for its tiny transverse vibrations. Likewise, 
if you calculate the speed of sound in a room, that speed is with respect to 
the air in that room, because sound waves travel in air. In general, the cal- 
culated velocity of a wave will be with respect to the medium that supports 
it. An observer moving relative to the medium will measure a different 
velocity. 

So it was naturally assumed that the speed c calculated by Maxwell 
was for an observer at rest with respect to the ether, the medium that was 
postulated to carry electromagnetic waves, the way air carries sound waves. 
People wanted to know more about this ether. First of all, the ether must 
be everywhere because we can see the sun, and we can see the stars. Then 
you can ask, how dense is the ether? Usually the denser the medium, the 
more rapidly signals travel. For example, sound travels much faster in a 
very dense material like iron than in air. So the ether would have to be 
very, very dense. And yet it allows celestial bodies to move through it for 
years and years without slowing down. It’s a very peculiar medium. And 
the question is, how fast are we moving relative to this medium? 

In 1887 Albert Michelson and Edward Morley did the experiment to 
find out. Suppose their lab was moving a speed v relative to ether at the 
instant of measurement. They should find the speed of light to be c — v. 
They got a speed of exactly c! Perhaps at the very moment Michelson and 
Morley performed their experiment, their lab just happened to be at rest 
with respect to the omnipresent ether. Fine. But we know that this has to 
change in 12 hours, when the lab’s velocity due to the earth’s rotation is re- 
versed, and in 6 months, when the velocity around the sun is reversed. You 
cannot be at rest with respect to the ether at all these times, and you cannot 
get the same answer of c all the time. Yet that is exactly what happens when 
you perform the experiment! 

So people tried other solutions. Consider the speed of sound, 
760 miles per hour. Why doesn’t that change from day to day as the earth 
rotates around its own axis and around the sun? How can you and I talk 


202 Special Relativity I: The Lorentz Transformation 


to each other, oblivious of these high speeds? The reason, of course, is that 
the earth carries the air with it as it spins around itself and around the 
sun. So if you can carry the medium with you, then it doesn’t matter how 
fast youre moving. When we fly, we carry the air with us in the plane, and 
sound travels at the same speed inside the moving plane as in a static one. 
When I ask for a pillow, it takes the flight attendant the same duration of 
time to pretend he did not hear me, independent of the speed of the plane. 
So, people tried to argue that the earth carries the ether with it, the same 
way it carries the air with it. But it’s very easy to show, by looking at dis- 
tant stars, that this cannot be true. I don’t have room here to tell you more. 
(Read up on aberration of starlight if you want.) You cannot take the ether 
with you, and you cannot leave it behind; that was the impasse. 

The velocity of light posed quite a problem. Let us pause to absorb 
that. Imagine a bullet going to the right at a certain speed v. You move 
to the right at some speed, say iy, I expect you to measure the speed of 
the bullet to be iy, But if this were a light beam and not a bullet, you are 
supposed to get the value v = c no matter what your speed was. You go at a 
speed že along the beam, and you still measure the value of c. None of the 
models of ether or materials could explain this. 

At this point Einstein entered the scene, and he explained the baffling 
behavior of light as follows. Recall that in Galilean relativity, an inertial ob- 
server who goes to sleep in a train parked at the station cannot tell, upon 
waking up, if the train has acquired a uniform velocity. But if the velocity 
of light depended on how fast the train was moving, then by comparing 
the velocity of light before and after the nap, the observer could deduce 
the speed of the train without looking outside. Because motion at uniform 
velocity produced detectable changes, without any reference to the out- 
side world, it was not relative but absolute. Even though mechanical laws 
remained the same in a moving train, laws of electricity and magnetism 
would betray its velocity. 

But, of course, this is not what happens, and the speed of light does 
not change with the speed of the train. This means electric and magnetic 
phenomena are part of the natural conspiracy to hide our uniform velocity. 
Just as mechanical phenomena won't tell us how fast we’re moving without 
looking outside, neither will electromagnetic phenomena. 

To Einstein it was obvious that nature would not design a system in 
which mechanical laws are the same in a moving frame, but laws of elec- 
tromagnetism are different. So he postulated that all phenomena, whatever 


Special Relativity I: The Lorentz Transformation 203 


their nature, will be unaffected by going to a frame at constant veloc- 
ity relative to the initial one. That was a very bold postulate, because it 
even applies to biological phenomena, which were not his specialty. But he 
believed that either all natural phenomena would follow the principle of 
relativity, or none would. 

Einstein had a faith that underlying laws of nature would have 
a certain uniformity across all natural phenomena. It’s not a religious 
issue—otherwise I wouldn't bring it up in the classroom—but it is cer- 
tainly the credo of all scientists, at least all physicists, that there is some 
elegance and consistency in the laws of nature. Even though they may 
not believe in design by any personal god, they do believe in this under- 
lying, rational system that we are trying to uncover. This belief has been 
reinforced over and over again. 


12.4 The postulates 


Here are the two great postulates of Einstein’s special relativity. 
Postulate I. All inertial observers are equivalent. 


Postulate II. The velocity of light is independent of the state of motion of 
the source and the observer. 


In postulate I, “equivalent” means each inertial observer is as privi- 
leged as any other to discover the laws of nature. If I find some laws, and 
youre moving relative to me at uniform velocity, you'll find the same laws. 
And if you and I find each other in relative motion, you have as much 
right to claim you are at rest and I am moving as I have to claim that 
I am at rest and you are moving. There is complete symmetry between 
observers in uniform relative motion. There is no symmetry between peo- 
ple in non-uniform motion. As I said, non-uniform motion creates effects 
that can destroy me and not destroy you. So, no one’s trying to talk his 
way out of acceleration, whereas you can talk your way out of uniform ve- 
locity. That’s the first postulate. This was understood even in the time of 
Newton. What is new now is that all inertial observers are equivalent with 
respect to all natural phenomena, including the electromagnetic, and not 
just mechanical phenomena. 

As per postulate II, if a light beam is emitted by a moving rocket, it 
doesn’t matter: it travels at c. If a light beam is seen by a moving rocket, 
it doesn’t matter: it will measure c. All people will get the same answer for 
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the velocity of light. It is a postulate because it cannot be deduced from 
anything else. 

It looks as if Einstein has solved the problem by saying that light be- 
haves this way because it is part of the big conspiracy to hide uniform 
motion. But you will see that he has made an expensive bargain trying 
to save relativity. His postulates will be seen to force him and us to give up 
many cherished notions of Newtonian physics. Think about why. Say a car 
is going 200 miles per hour according to me. You get into your own car 
and follow that car at 50 miles per hour. I expect you to measure its speed 
to be 150 miles per hour. But what if you also got 200 miles per hour? 
Now, this is not what happens for cars traveling at the speeds I mentioned, 
but that is exactly what happens if the car is replaced by a pulse of light. 
You must agree that is really incompatible with our daily notions and the 
formula w = v — u. When you put v = c, somehow w has got to come 
out to be c and not c — u. Because that does not happen in the Galilean 
transformation, it has to go. 


12.5 The Lorentz transformation 


We are led to seek a new rule or transformation connecting (x,t) and 
(x’, t’), such that when the velocity of light is computed, we get the same 
answer of c in both frames. Here is a clue to how things will work out. Sup- 
pose I send a pulse to the right at speed c, and you are going to the right 
at xe. My Newtonian expectation is that you should measure the speed of 
the pulse as 3. But you insist it is c. What will I say to you? Because you are 
finding the velocity of the pulse as the distance traveled divided by the time 
taken, and you are getting four times the answer I expected, I will say your 
“meter” sticks are somehow shorter than a meter. Specifically, I would say 
they have shrunk to one-fourth their original size (when we were at rel- 
ative rest), resulting in your measurement of a velocity four times what I 
expected. 

But there’s another possibility. Your clocks, which used to be iden- 
tical to mine when we were at relative rest, may be running slow. So you 
let the light travel for four seconds and thought it was only one second. 
That’s why you measured a velocity four times bigger than what I expect. 
Or maybe both your meter sticks and clocks are messed up. But something 
has to give, and so in searching for the replacement to the Galilean trans- 
formation of coordinates Eqns. 12.1 and 12.2, we will no longer assume 
length and time intervals are the same for both observers. 
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Consequently, the spatial coordinate of the event I labeled (x, t) will 
not be x’ = x — ut according to you, but 


x =y(x— ut) (12.12) 


where y is the fudge factor to convert lengths from me to you. Similarly, 
I modify your expectation that x = x’ + ut’ (we admit the possibility that 
t Æ t) to read 


x=y(x + ut’) (12.13) 


where the fudge factor y is the same from me to you and vice versa, because 
otherwise we would not be equivalent, and one of us would be holding the 
shortened “meter” stick. Since u —> —u when we change from me to you, 
we expect y to be a function of u?. We proceed to nail down y as follows. 

Suppose we sent off a light pulse when our origins coincided, and this 
pulse set off the firecracker at (x, t) according to me and (x’, t’) according 
to you. Because the light pulse took t seconds to travel x meters according 
to me and took tf’ seconds to go x’ meters according to you, and we both 
agree on the value of c, it must be true for this particular event that 


x=ct and x’ =ct’. (12.14) 

Let us multiply the left-hand side of Eqn. 12.12 by the left-hand side 
of 12.13 and equate the result to the product of the right-hand sides to 
obtain 


xx = y? (xx + xut — x ut — uw tt’). (12.15) 


Upon setting x = ct, x’ = ct’ we find 


Ctt = y*(c tt + uct’ — uct t — u’ tt’) (12.16) 

2 

2 u 
e (1-5) (12.17) 
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Note that once we have found the length conversion factor y it does 
not matter that we deduced it from this specific event involving a light 
pulse. It can be applied to Eqns. 12.12 and 12.13 valid for a generic event. 
Putting y back into Eqn. 12.12 we obtain 


— ut 
go. (12.19) 


We now go to x = y (x’ + ut’), Eqn. 12.13, isolate ¢’ and express it entirely 
in terms of x and t as follows: 


== (=-7@-u)) 
u\y 
_? 


2 
= («(1-5)-G@-u) 
u c 


ux 
t- = 
c2 


=E Oo (12.20) 


J1l— w/e 


To summarize, imagine I am (in frame) S and you are (in frame) S’, 
and you are moving to the right (increasing x direction) at speed u. Let 
my coordinates for an event be (x, t) and let your coordinates for the same 
event be (x’, t’). The Lorentz transformation tells us that 


— ut 
P ed (12.21) 
J1l—w/c? 
t— Ëx 
f= c (12.22) 


J1— w/e 


Observe that the Lorentz transformation reduces to the Galilean 
transformation if the velocity between you and me is much smaller than 
the velocity of light, that is, << 1. So the relativistic formula really kicks 
in only for velocities comparable to the velocity of light. 
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You must clearly understand what the formula is connecting. Things 
are happening in space and in time, right? Say something happens here. 
That something has a spatial coordinate and a time coordinate, according 
to two observers who originally had their origins and their clocks coin- 
cide when they passed. And one is moving to the right at speed u. Then 
the claim is that if the event had coordinates (x, t) for one person, for the 
other person moving to the right at speed u, the same event would have 
coordinates (x’, t’), related to (x, t) as above. 

Now we can see why Einstein gets the credit for making the world 
four-dimensional. After all, the four coordinates x, y, z, and t were present 
before he came along. But t’ was always equal to t in the old days, no mat- 
ter how you moved. In Einstein’s theory, x,y,z, and t get scrambled into 
primed variables. Here is an analogy. Imagine creatures restricted to the 
x —y plane. When they rotate axes, x and y get scrambled into each other. 
Now, there is a third dimension perpendicular to these two, labeled by z, 
but if all their rotations are limited to the x — y plane, z will never mix 
with x and y, and z’ = z before and after rotation. This is what makes their 
world two-dimensional. If you now permit these creatures to rotate out 
of the x — y plane, then z will indeed begin to mix with x and y, by tiny 
imperceptible amounts for small tilts (and they may not realize that z is 
just another coordinate) and sizable ones for larger tilts where z gets se- 
riously mixed up with x and y. In our problem * is the tilt in spacetime. 
When our experiments were limited to ¥ << 1, we thought t was an in- 
variant, the same for all observers. Einstein then showed us the complete 
picture. 

You can already see that the theory will not admit velocities bigger 
than the velocity of light because the square root \/ 1 — u/c? then becomes 
imaginary. So the one single velocity that we wanted to be the same for 
everybody is also the greatest possible velocity, and no observer can move 
with respect to another at a speed that is equal to or in excess of the speed 
of light. 

The fact that an event has different coordinates in different sys- 
tems doesn’t mean the laws deduced are different. For example, suppose 
Pm on the ground and I throw a piece of chalk; it goes straight up and 
comes straight down. If you see me from a moving train, you would think 
it went up and down but along a parabola. No one says the chalk will 
also go up and down for you, only that its motion will still obey New- 
ton’s laws. That’s all you really mean by saying things look the same in 
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both inertial frames. Even though Einstein’s theory upheld the princi- 
ple of relativity, it discarded Newton's laws. They had to be modified at 
velocities comparable to c. However, relativity assures us that the same 
modified laws will be deduced by two observers in uniform relative 
motion. 


CHAPTER 13 


Special Relativity I: 


Some Consequences 


13.1 Summary of the Lorentz transformation 


Let us begin with the Lorentz transformation, which relates the coordi- 
nates of an event in two different frames of reference, with the primed one 
moving at a velocity u relative to the unprimed one: 


j x— ut 


u 
paa e (13.2) 
1—u2/c* 


The Lorentz transformation is the cornerstone of relativity; all the 
funny stuff you hear about—the shrinking rods, the twin paradox— 
everything comes from these simple equations, derived without even 
calculus. If you consider the stresses and strains on a loaded steel beam, 
the mathematics involved is a whole lot more difficult. That is the remark- 
able thing about relativity. In extracting the bizarre consequences of these 
equations, Einstein showed as much courage as brilliance. If you derived 
these equations, you own them and have no choice but to deduce and 
defend the consequences. 

Now, some of you might be rattled by Eqns. 13.1 and 13.2; it’s not 
clear what is being stated. Let me explain. The velocity u is a fixed number. 
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That’s your speed relative to mine. I see some event happening, and I give it 
a pair of numbers, (x, t). You give the same event another pair of numbers, 
(x’, t’), and they are related by the Lorentz transformation. 

Here is the analogy. Consider the x — y plane with a point P as in 
Figure 13.1. I assign to it a pair of numbers, (x,y). Now, you have a dif- 
ferent coordinate system, rotated relative to mine by some angle 0. Our 
coordinates are related as follows: 


x’ = xcos0 + ysin (13.3) 
y =—xsinO + ycosð. (13.4) 


The angle @ is the analog of the velocity u. If 0 = 0, you and I agree 
completely. In general, you plug in my (x,y) into these equations and get 
your (x’,y’). Let’s take 0 = 4. In that case, 


POY ong a 
V2 v2 | 


So, for every angle, the cosine and sine will reduce to some numbers; both 
happen to be 1/./2 here. For example, if (x,y) is (1,1), and your axis is 
obtained from mine by a counterclockwise rotation of 45°, my (1, 1) lies 
right on your x’ axis. So, I expect you to say y’ = 0, and this is indeed so. 
And how about x’? It becomes (1+ 1)//2 = V2, the length of the position 
vector, which is entirely along the x’ direction. 


(13.5) 


Figure 13.1 The same point P (solid dot) is assigned coordinates (x, y) by me 
and (x’,y’) by you. The pairs of coordinates are linearly related. The dotted lines 
indicate the sizes of the coordinates in the two frames. 
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The Lorentz transformation is very similar. For example, the first 
equation can be written as 


— 1 a a 
oe 


so that 


(13.7) 


1 u 
a i 
Jiaj E iA 


are the analogs of cos@ and sin. The same goes for the second equation 
involving t’. For a given u, these are just numbers depending on u, and 
(x’, t’) is a linear combination of (x, t). But I should warn you that the coef- 
ficients in square brackets are not cosine or sine of anything, because if they 
were, their squares should add up to 1, and they don’t. It is, however, still 
a linear homogeneous transformation. Linear means the new coordinates 
(x’,t’) are related to the first powers of the old coordinates (x,t). They 
don’t involve t? or x? and so forth. Homogeneous means that only the first 
power is present. This ensures that if (x, t) = (0,0), then (x’, t’) = (0,0) as 
well. 

Suppose you wish to invert these equations and write (x, t) in terms 
of (x’,t’). There are two options open for you. One is to say these are 
simultaneous equations. You have to find a way to solve for (x,t) in 
terms of (x’,t’) and all these funny coefficients involving u. You treat 
them all as constants and juggle them around, multiply by this, divide by 
that, and so on, to isolate x and t in terms of (x’,t’). But you shouldn’t 
do that because you know what the answer should be, namely, the old 
Lorentz transformation with the velocity reversed: 


/ / 
ae (13.8) 
JV1l— w/c? 


t + ax 


J1— w/e 


(For rotations, you can express (x,y) in terms of (x’, y’) by reversing 
the sign of 8 in Eqns. 13.3 and 13.4.) 


t= (13.9) 
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A crucial step in getting some mileage out of the Lorentz transfor- 
mation is to take a pair of events. Let event 1 have coordinates (xj, tı) 
according to me, (xp t) according to you, and similarly for event 2. If 
you write the Lorentz transformation for the two events and subtract the 
primed coordinates of 1 from those of 2 you will find 


Ax—uAt 
E E ae al (13.10) 
V1- w/e 
ia a (13.11) 
t = — ? 
V1l—w/e 


where Ax = x — x] et cetera. Note that the differences are not necessarily 
small. 

The differences in coordinates also obey the same Lorentz 
transformation as the coordinates, thanks to the linearity of the 
Lorentz transformation. If you want the differences that I get in terms of 
yours, you just have to reverse the sign of u: 


Ax +uAt’ 
Ax = = (13.12) 
l= /e 
At + 5A 


13.2 The velocity transformation law 


The two events chosen above were unrelated. Now consider them to be 
related as follows: 


e Event 1: I fire a gun. 
e Event 2: The bullet hits the wall and gets embedded. 


The bullet moves by an amount Ax in time At according to me, and 
by Ax’ in time Ar’ according to you. 
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Let us agree on the following notation for velocities once and for all: 


A 
v= = velocity of any object according to me (13.14) 
Ax’ : : ; 
wa velocity of any object according to you (13.15) 
u= your velocity relative to me, (13.16) 


where all A’s should of course be infinitesimals approaching 0 as we are 
talking about instantaneous velocities. Suppose I say the bullet has velocity 
v. To find the velocity w according to you, we take the ratio of Eqns. 13.10 
and 13.11: 


An Ax—uAt ax y v—u 
MW =A pas e" (13.17) 
c ce At c2 


where, in the end, we have let all the difference go to 0. For small velocities 
(dropping the terms that go as 1/c”) we get results agreeing with common 
sense. Equation 13.17 is the relativistic velocity transformation law. 

Let us get used to going from your description to mine. Suppose you 
think a particle has velocity w. What will I think? Now we use Eqns. 13.12— 
13.13. Taking the ratios as before and recalling the definition of w we get 


wtu 
v= 17 (13.18) 
c2 


which amounts to changing the sign of u, and exchanging w and v in 
Eqn. 13.17. This velocity transformation law is what makes it possible to 
have an upper limit on velocity. If velocities added as in the old days, there 
could be no upper limit. If I tell you that according to relativity, nothing 
can go faster than the speed of light, you might try to beat the system as 
follows. You might ask if there can be a gun whose bullets go at three- 
fourths the velocity of light. I would say yes, and you would then ask, 
“How about a train that goes at 3e» I would say that seems to be allowed. 
Then you can say, “Let me get into this train at = and fire a bullet at x; 
then, from the ground, it should appear to be going at 1.5c.” Well, that’s 
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the naive pre-relativistic expectation. But if you put w = žc and u = + 


into the correct formula Eqn. 13.18, you get the correct answer to be 


3¢ 3c 
S64 26 24 

y= E (13.19) 
1 + 16 


If I apply this to a light pulse seen by you (w = c), the speed I will find is 


ja a (13.20) 
1+ u/c 


13.3 Relativity of simultaneity 


We used to think simultaneity was absolute. If two events occurring in 
Los Angeles and New York are simultaneous for me, fixed on the planet, 
then for you, flying in a rocket, they should be simultaneous. How can 
they not be? If two things are happening right now in different places for 
me, the same has to be true for you. But it’s not, as a consequence of the 
Lorentz transformation. 

Before I explain that, I must point out that there is one case when 
simultaneity is absolute. If the two events occurred at the same time and 
the same place, Ax =0, At = 0, then the transformation will tell you Ax’ = 
0, At’ = 0. If I clapped my hands, my two hands were at the same place at 
the same time. If someone says my hands were not at the same place at the 
same time, she is saying I didn’t clap. Relativity may change the coordinates 
of an event as we change observers, but it will not deny the very event. 

Now for the general case of the relativity of simultaneity. You, S’, 
are in the train, as shown in the upper half of Figure 13.2. To make two 
things happen simultaneously at the two ends of the train, you go to 
the middle of the train and send two flashes of light that go to the back 
(B) and front (F) of the train and set up two explosions. Because you're 
in the middle of the train, you know the explosions will be simultane- 
ous for you. Now, I see you from the ground. What do I see? The front 
F of the train is moving away from the light pulse, while the back B is 
rushing to meet the light pulse. Now, the velocity of light is the same for 
everybody and independent of the velocity of the source. If the back end 
is rushing to meet the light pulse and the front end is running away 
from the light pulse, I know the explosion at the back of the train occurs 
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Train and light signals as seen by S’ 


B F 


Train and light signals as seen by S 


B F 


Figure 13.2 Top: The train as seen by you, observer S’, at rest on the train. You 
emit a light pulse from the center of the train (big dot), which hits the front and 
back ends F and B at the same time, setting off two simultaneous explosions 
(small dots). Bottom: I, observer S, see you and the train moving to the right at 
speed u. I see the back end of the train rushing to meet the light pulse and the 
front end running away from it. The figure clearly shows the pulse reaching the 
back end B, while the pulse headed for F, the front, has not yet arrived. 


first. Now, why do we bring in the two counter-propagating light pulses 
and not a pair of counter-propagating pigeons to trigger simultaneous 
events? Because the velocity of light is the same for everybody, indepen- 
dent of the motion of the source or the observer. That’s a postulate. That’s 
why many arguments in relativity involve doing things with light pulses or 
communicating with light pulses: we know the speed of light in any situa- 
tion. (We know a lot less about pigeons.) Therefore, we know you couldn’t 
have done any better in making them simultaneous, and I know for sure I 
will disagree with you. There’s no answer to the question, who is right? 
Here is an analogy. Look again at Figure 13.1. First, if two points have 
the same x and y coordinates in one set of axes, this will be true in any other 
set of axes because they are on top of each other. Next, if I pick two points 
that lie on a line parallel to the x-axis, then for me they have the same 
y. If you use rotated axes, you will say they have different y’-coordinates. 
There is nothing absolute about having the same y coordinate; it depends 
on the frame. In relativity, this happens to be true for time as well as space, 
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and the two frames are related not by a relative rotation but by a relative 
velocity. 


13.4 Time dilatation 


The next surprise has to do with clocks and time. You and I buy two iden- 
tical clocks. Then you get into your train and start moving relative to me. 
I will find that your clock is running slow, and you will say the same about 
mine. 

How do we turn this issue of the time period of the clock into a pair of 
events? I have my clock, and it goes tick, tick, tick. I pick two events. Event 1 
is a tick of the clock, and event 2 is the next tick. Let me put the clock at the 
origin of my coordinates system. (It doesn’t really matter where it is.) The 
spacetime coordinate of the first tick is (x = 0, t = 0). What is the location 
of the clock during the second tick? Because I’m talking about a clock at 
rest in my frame, if the first event took place at x = 0, the second one also 
takes place at x = 0. The two successive ticks of the clock are separated 
in space by 0 and in time by 79, its time period. That means Ax = 0 and 
At = Tọ. According to you the time between two successive ticks is 

u 


,_ n-og To 


ae deare 


Because the denominator is < 1, we find At’ > tọ. For example, if 
‘= 2, you will say my clock has a time period 30. In fact, anyone moving 
relative to me will say my clock is slow. 

Let us rederive this result by choosing the inverse Lorentz transfor- 
mation, in which case my coordinates are written in terms of yours. Now 


we find 


At (13.21) 


At =) = —— (13.22) 


and bring in the other equation that states that the two events (successive 
ticks of my clock) took place at the same point for me: 


T AX + uAt' 


y1— wJ 


Ax=0 (13.23) 
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which means Ax’ = —uAt’. (This just means you see me and my clock 
moving to the left at speed u.) Feeding this into Eqn. 13.22 we conclude 
that 


At’ {= 2 2 
poe a a At 1- w/e, (13.24) 
JV1l—u2/c2 


which agrees with Eqn. 13.21. 

Now, here is a paradox. Just as I will say your clock is slow, you'll say 
my clock is slow. How can we both accuse each other of having slow clocks? 
(A related question for the psych majors: how can two people simultane- 
ously look down on each other?) Here’s the answer that’s usually given. If I 
take a real clock like my watch and you ask why it looks slow to you when 
I’m moving relative to you, it’s difficult to answer because it has electron- 
ics and stuff, and I am still working on setting the clock on my microwave. 
For all of us who are technically challenged, there is a clock that’s partic- 
ularly simple. It has two mirrors a distance L apart in the y-direction and 
a light pulse that bounces up and down between the mirrors as shown in 
part A of Figure 13.3. Every time the pulse completes a round trip, it sets 
off some detector at the lower end, and the clock goes “tick.” The pulse 
travels a total distance 2L in the vertical direction between ticks, so that 


ut’ 


Figure 13.3 Part A shows my light clock in my frame. The light pulse goes 
straight up and down a total distance 2L between ticks. Part B shows the same 
clock as seen by you, to whom my clock seems to be moving to the left at speed u. 
You say the clock has slowed down because the zigzag path is longer than the 
straight up-and-down path, while the velocity of light is the same for both. A 
clock at rest in your frame will be moving to the right according to me, and its 
light pulse will be moving in a zigzag path and hence running slow according 

to me. 


218 Special Relativity II: Some Consequences 


To = 2L/c is the time period of my clock. Now you are moving relative to 
me at velocity u, so according to you my clock is moving to the left at speed 
u, and it looks like part B in the figure. The light beam in my clock is going 
on a zigzag path along the hypotenuse. The time period f’ according to you 
obeys the equation 


f j ut! \? 
ct =2,/L*+ ze (13.25) 


which can be solved to give 


oe) _ To 
Jl-w/e y1- 


i (13.26) 


In this argument I am invoking the fact that the vertical distance be- 
tween the mirrors in the y-direction is L for both observers. Here is one 
way to understand this. Suppose you and I paint an infinite line at height 
y = y = 1 meter when we are at relative rest. As we now move relative 
to each other, our lines must match: there is no reason one should be be- 
low the other given the equivalence of inertial observers. The point is that 
the two lines, which extend forever, can be compared anywhere and at any 
time and do not run away from each other the way localized objects like 
clocks or rods do. 

Why do we like this clock? Because we know everything about the 
operation of the clock. We know that the zigzag path is longer than the 
straight up-and-down path, because the transverse coordinate is known to 
be the same for both. Because the velocity of light is the same in all frames 
of reference, light going on a longer path is simply going to take longer. 
So, I know your clock would slow down, and you can say my clock has 
slowed down by the same factor because the zigzag paths have the same 
lengths. 

What if you have some other clock with gears and wheels and teeth? 
How do I know that it too slows down when you go on your train? I don’t 
know exactly why it appears slowed to me, but I know that it too should 
appear slowed just like the light clock. To see this, consider a light clock 
L and an electronic clock E of the same time period that you take on the 
train. You of course will find them to be in sync because you are inertial 
and think you are at rest. Let the clocks be at the same location x’ = 0 with 
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their ticks synchronized at t’ = 0. The next ticking of the two clocks have 
the same spatial and time coordinates in your frame, and therefore in my 
frame. The clocks will be in sync according to me as well. 

Therefore, all clocks must do the same thing as the light clock, re- 
gardless of their mechanisms, and that includes biological clocks. Now you 
yourself are a biological clock: I look at you, say over a period of 40 or 60 
years, and I notice some changes. You become taller; your hair turns white; 
your teeth fall out. I don’t care how your life systems work, but you are a 
clock and you've got to slow down in my view if you move relative to me. 
That’s why physicists can make predictions about what happens to living 
systems even though that’s not our main business, which happens to be 
staring at our shoes at cocktail parties. 


13.4.1 Twin paradox 


Say my twin goes on a space mission at some high speed for 20 years. Let’s 
say he was 20 when he left, so that in the pre-relativity days, I would expect 
him to be 40 when he gets back. But he will come back younger, because 
as a clock, he has slowed down. What I think is 20 years could be 10 for 
him. So, he can come back 10 years younger than me. But what if he says 
the same thing and expects me to be younger? 

Who will be younger? There can only be one answer to that question, 
and if I can get it one way, it has to be right. 

Now, I have no reason to believe that I moved. I have been inertial 
the whole time, and I predict he will be younger. He cannot say the same, 
because he must have accelerated: he cannot go away and come back with- 
out first speeding up, then slowing down, reversing, speeding up toward 
home, and finally coming to rest on the earth. During all these periods of 
acceleration my twin has lost the right to argue he is equal to me. 

Coming to the counter-moving clocks: suppose they both read zero 
when they cross. Now we wait for some time during which they are mov- 
ing in opposite directions at uniform speed. A direct comparison will be 
impossible as long as they are both non-accelerated. To compare them, 
one or both of them have to change directions and of course velocities. 
If they did this symmetrically, they will agree at the second comparison, 
but if one was inertial the whole time and the other was not, the latter 
would be behind. If two of a set of triplets went away in opposite directions 
and returned on symmetric paths, they will both be younger than the one 
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who stayed behind in the inertial frame. This is not science fiction at all. 
If you want to be alive for the year 3000, you can do it. You do the math; 
you find the appropriate speed (nearly that of light) so you will be, say, 
just 2 years older; you get into the rocket and off you go to a distant star 
(nearly 500 light years away); and you come back. Fortunately, you can- 
not foist pictures of your trip on any of your friends because they will be 
long dead. 

Now, this experiment is done all the time with subatomic particles. 
They are accelerated in Fermilab, for example. They go around in a ring, 
and, just by virtue of their motion, they live a very long time. Particles that 
are supposed to have a short lifetime (which you calculate in their own rest 
frame) live much longer because they are moving. 

Summary: Every clock runs the fastest in its own rest frame. (If it is 
a light clock, the light pulse follows the zigzag path in any other frame and 
appears to be slow.) 


13.4.2 Length contraction 


Length contraction refers to the fact that if you and I bought two identical 
meter sticks, and you set off in a plane or a rocket, I will claim your meter 
stick is actually shorter than a meter, and you will say the same about mine. 
In fact, way back, in the derivation of the Lorentz transformation, the 
fudge factor I calculated, taking your lengths into my lengths, is precisely 
connected with this factor. If you say the length of an object is something, 
I will tell you that the length is actually less because your meter sticks are 
short. And you will say the same about my length measurements. 

Let me prove that to you. Let’s take a rod that’s moving at a speed 
u. Youre carrying the rod, and I want to find its length. What should I 
do? I’m going to introduce a pair of events to find the length of the rod. 
Remember this rod is zooming past me. Event 1 is when the front end 
of the rod hits a certain marking on my graduated x-axis, and event 2 is 
when the back end of the rod crosses another point. The distance between 
those two is the length of the rod, provided one condition is met: I must 
locate the two ends simultaneously. Otherwise, I will screw up, right? If I 
find the front end now, go ona lunch break and come back, and then locate 
the other end, I will get a smaller result, even negative, because during this 
time period the rod has been sliding to the right. 

Consequently, the two events have coordinates with Ax = L, At =0 
in my unprimed frame. In your primed frame the two measurements took 
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place at the two ends of the static rod; hence they are separated by its rest 
length Lo. Thus 


L—u-0 
Lo = ——. (13.27) 
1—u*/c* 


If you cross-multiply, you get the relation between the rest length Lo 
and the length in the frame in which the rod is moving at speed u: 


L= Loy 1-— u/c. (13.28) 


We could equally well consider the inverse transformation, and pic- 
ture the rod to be at rest in the primed coordinates. In that case we 
find 


I At’ 

L= a (13.29) 
Vl—-ur/c 
At + 

0 = ——— < (13.30) 


JV1— u?/c? 


where the second equation signifies that I measured the two ends simul- 
taneously and tells us At’ = —uLo/c?. Feeding this into the first equation, 
we find once again that L = Loy 1 — u*/c?. 

So, a rod will appear longest in its rest frame and a clock will appear 
fastest in its rest frame. 

In what sense is this length contraction real? In what sense is a meter 
stick contracted to half a meter the same as a half-meter stick that is not 
moving? The answer is that they will both fit in a static case half a meter 
long: the static half-meter stick for all times, and the moving one for just 
one instant. 

I can say your meter stick is only half a meter long if you move fast 
enough. But you are inertial, and you can say my meter stick is half a meter 
long. How then do you explain my result that your stick is half as long as 
mine instead of the other way around? I will now discuss this paradox. 
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13.5 More paradoxes 
13.5.1 Too big to fall 


You and I bought identical rods one meter long, but you and your rod 
are now moving along the x-axis on top of my infinitely long table, at a 
velocity u such that y~! = y1 — u2/c? = .5, as shown in the top part of 
Figure 13.4. The table has a hole .5 m long. I will find that there will be an 
instant when the entire rod fits within the hole. But you say, “My rod is a 
meter long, I am at rest, you and your table are the ones moving to the left 
at speed u. The hole that you think is .5 meters long actually is .25 meters 
long. So, there will never be a time when the two ends of my meter stick 


Table rest frame times 1,2 


—_ L R 
= te 
1 2 
Rod rest frame -1 ‘ L R Ir 
Rod rest frame 2f — l 


Rod rest frame 2b e— E 


Figure 13.4 At the top are two snapshots (1,2) in my (table) rest frame: the rod 
has shrunk to .5 meters and is sliding over my infinite table, approaching a hole 
.5 meters long. At my time 1, the rod is to the left of the hole; at my time 2 it fits 
right over the hole with its right and left ends (R) and (L) aligned with the right 
and left ends (r and 1) of the hole. In the lower part are three views from the rod 
(your) rest frame. At some earlier rod time, —1, the table is rushing to the left 
with a hole .25 meters long. The right end (r) of the hole passes the right end of 
the rod (R) at rod time 2f. At a later time 2b, the left end (1) of the hole passes the 
left end (L) of the rod. Events 2f and 2b took place simultaneously at my time 2. 
If we introduce a weak gravitational field near the hole (tiny vertical arrow), the 
rod will fall through according to me, and it must do so according to you as well. 
But how? The answer is provided in the main text. 
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lie entirely within that hole.” When this experiment is performed, will the 
rod fit in the hole or not? That is the paradox. It is depicted and resolved 
in Figure 13.4. 

Look at times labeled 1 and 2 in the top part of the figure. At my time 
1, the rod is approaching the hole. At my time 2, its right and left ends (R 
and L) line up with the right and left (r and 1) ends of the hole. So the rod 
does fit in the hole according to me. But you do not agree. Your description 
is shown in the lower parts of the figure. At your (and the rod’s) time —1, 
the table is rushing to the left with a hole .25 meters long. Next, the right 
end (r) of the hole passes the right end of the rod (R) at your (rod) time 
2f. At a later time 2b, the left end (1) of the hole passes the left end (L) 
of the rod. The rod is a full meter long, and at no time does it lie entirely 
within the hole. The events that took place at your times 2f and 2b took 
place simultaneously for me at my time 2. 

This disagreement can be made more dramatic if we add a tiny 
gravitational field near the hole, pointing along the small arrow in the up- 
permost figure. Now the shrunken rod will fall through the hole in my 
table frame. Then it must also fall through according to you, because there 
cannot be two answers to whether or not it did. Here is the explanation 
of how a rod of length 1.0 m can fall through a hole of width 0.25 m, fol- 
lowing W. Rindler, American Journal of Physics 29:365 (1961). It will help 
in this discussion to assume, without altering the paradox, that the hole is 
infinitesimally wider than the shrunken rod. 

First, imagine that a trapdoor under the table supports the rod until 
the left end of the rod crosses the left end of the hole. This will support 
the rod until it is fully over the hole. Let this crossing of the two left ends 
define the origin in spacetime: (x, t) = (x’, t") = (0,0). 

I will refer to your inertial frame as the rod frame, though you will 
keep going at velocity u, even if the rod begins falling. You share only the 
rod’s horizontal velocity after it begins to fall. 

At t = 0, the trapdoor drops down rapidly, allowing the rod to fall 
unhindered under gravity, with a tiny acceleration a. In the table frame, 
the entire rod begins to accelerate at time t = 0. We may safely employ 
pre-relativistic, Newtonian theory to describe the downward acceleration 
under the weak gravitational force. At time t > 0, a point on the rod located 
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at x has the vertical coordinate z(x, t) (measured downward): 


1 
z(x, t) = sae - O(t) where (13.31) 
O(t) =0ift <0 (13.32) 
=lift>0. (13.33) 


The function © (t) is just a compact way of saying that the downward ac- 
celeration exists only for t > 0. That z(x, t) is independent of x reflects the 
fact that all parts of the rod suffer the same acceleration under gravity and 
that the falling rod remains horizontal. 

All we need to do now is to transcribe this result, spacetime point by 
spacetime point, to your frame, to find out what you will see. Using 


/ 
t= G + =) y (13.34) 
C 


and the fact that the transverse coordinate of any event is the same for both 
frames, we find 


Phd igh l 4 ux’ f 2 1 ux! 
2{x,t)=z%,t)= >a t+)? Oly ro 


where (x’, t’) and (x, t) correspond to the same spacetime point. 

Though all points in the rod begin falling at the same time t = 0 in 
the table frame, these events are not simultaneous in the rod frame. While 
the left end begins falling at t = 0, the rest of the rod begins falling at an 
earlier (negative) time, which I will call the drop-time, t (x’) that depends 
on x’. Equation 13.34 tells us that t = 0 corresponds to 


ux! 


(x!) = s (13.36) 


As expected, the left end x’ = 0 starts dropping at t’ = 0. As we move toward 
the right end, the drop-time turns more and more negative. The right end 
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itself is seen to be the first one to start falling, at time 


1 uL 

ge = Ip) =-—-s (13.37) 
é 

where Lp is the rest length of the rod. 


To avoid negative times, let us define a shifted time in your frame: 
uL 

"=t. (13.38) 
E 


Now the right end starts falling at your time t” = 0, the left end starts 
falling after t” = “o, and intermediate points begin falling at intermediate 
times. 

In your frame, the falling rod is not straight. In the time interval 0 < 


t" < — the rod is horizontal with z’ = 0 from its left-most end up to 
some drop-point Xp and it droops downward beyond. We determine the 
drop-point K as a function of t” by turning to the same Eqn. 13.34, which 
previously gave us the drop-time AEA ) as a function of x’. We find 


2 


i ct e uLo er 
x,(t') =— 7 = |” - z |-»-=. (13.39) 


At t” = 0, the whole rod is straight, while at t” = uLo/c’, even the left 
end has begun falling. 

In your frame, the right edge of the hole moves leftward at speed u, 
and it is always at z’ = 0. Since it was to the right of the right end of the rod 
to begin with, it will never overtake the drop-point, which moves leftward 
at a speed A Since = > c, by the time the right edge of the hole crosses 
any point on the rod, that point will have already fallen below z’ = 0. (That 
z > c does not imply interactions or signals propagating at superluminal 
velocities. The points that make up the rod fall independently of each other 
and do not interact with each other in our treatment.) 

If this discussion leads you to conclude that the notion of a rigid body 
does not exist in relativity, you are correct. 

In summary, you and I can accuse each other of using shortened rods, 
and we can both be right. This is because the operational way to find the 
length of a moving body involves measuring the two ends simultaneously, 
and simultaneity is relative: you will say you measured the two ends of my 
rod at the same time and I will disagree, and vice versa. 
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13.5.2 Muons in flight 


Muons are particles produced in the upper atmosphere and detected on 
earth. After an (average) lifetime of 2.2 us in their rest frame, they decay 
into an electron and a neutrino-antineutrino pair. Their lifetime multi- 
plied by c equals roughly 660 meters, far short of the distance between 
their birth in the upper atmosphere and their detection on earth. How 
then do they make it to the ground? The answer depends on the point of 
view. Earthlings will say the muon lifetime of 2.2 us in its rest frame is pro- 
longed by relativistic time dilatation. In other words, the birth and death 
of the muon will coincide with successive ticks of a clock of period 2.2 us 
in the muon rest frame, and this clock will appear slowed to us on earth. 
The muon, on the other hand, will agree that it lives only 2.2 us, but it will 
claim that the distance between the upper atmosphere and the ground is a 
lot less than we claim it is, due to length contraction. The dying muon is 
like a clock that has slowed down according to us, and the atmosphere is 
like a rod that has Lorentz contracted according to the muon. 


CHAPTER 14 
Special Relativity III: 


Past, Present, and Future 


In this chapter we continue to explore the consequences of the Lorentz 
transformation. 


14.1 Past, present, and future in relativity 


Let’s take the equation for the time difference between two events num- 
bered 1 and 2: 


(14.1) 


First, something happens; then, something else happens, and At = h — tı 
is a separation in time between them. Let’s say At > 0 so that the second 
event occurred after the first event, according to me, using unprimed co- 
ordinates. How about according to you? Well, At’ doesn’t have to have the 
same sign as At because you subtract from it this number, “4%, which can 
be arbitrarily large and positive. Therefore, you can find that At’ could 
be negative. Surely you understand that’s a big deal. I say this happened 
first and that happened later. Then, you say, no, it happened the oppo- 
site way. Now, this can lead to serious logical contradictions, especially if 
event 1 is the cause of event 2. Here is a standard example both in special 
and general relativity that people talk about: event 1 is the birth of some 
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kid’s grandmother, and event 2 is the kid’s birth. The birth of the kid takes 
place long after the birth of the grandmother, according to me. What if, 
from some other point of view, the kid is born, but the grandmother is not 
yet born, and something is done to prevent her from being born. Where 
did the grandchild come from? Or consider this: event 1, I fired a bullet. 
Event 2, somebody is hit. And you go to the frame of reference in which 
the person has been hit, and I haven't fired the bullet. Now, you come and 
you finish me off. So now, we have a person wounded for no apparent rea- 
son because the cause (my firing) has been eliminated. That simply cannot 
happen. Einstein recognized that if A can be the cause of B, we better not 
find an observer for whom these events occur in reverse order, because if 
the cause occurs after the effect, then there is some time left for somebody 
to prevent the cause itself from happening, and we will have an effect with 
no perceivable cause. So, we want to make sure that the sign of At’ can- 
not be reversed whenever the first event is or could have been the cause of 
the second event. So, you ask Eqn. 14.1: if At is positive, when will A?’ be 
negative? That is simple algebra; you want the second term in the numer- 
ator to beat the first term. That will happen in any frame with velocity u 
obeying 


uAx 
= > At (14.2) 
c 
At 
a nna (14.3) 
č Ax 


Now compare the two distances: 


e cAt, the distance a light pulse can travel in the time At between the two 
events. 
e Ax, the actual spatial separation between the events. 


If cAt > Ax, there is enough time for a light signal to go from event 1 
to event 2. In this case the frame we are looking for is going faster than 
light: # > 1. Such a frame of course does not exist. So the order of events is 
the same for all possible observers. But this is the case when event 1 could 
have been the cause of event 2 because a light signal could have been used 
to cause event 2. By the same logic if cAt < Ax, there isn’t enough time 
for a light signal to go from event 1 to event 2. In this case the frame we 
are looking for moves at = < 1. Such a frame of course exists. So the order 
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of events is different, with 2 occurring before 1. But this is the case when 
event 1 better not have been the cause of event 2. This is assured if the 
maximum speed with which one can influence other events is the speed 
of light. Because this pair of events is separated by a time that is too short 
even for a light signal to go from 1 to 2 in time, they could not have been 
causally connected. 

In other words, we are going to say that if there isn’t enough time for 
a light signal to go from event 1 to event 2, then event 1 could not have 
been the cause of event 2. Therefore, for the theory to make logical sense, 
no signal should travel faster than the speed of light. 

When we heard rumors in 2011 about neutrinos traveling faster than 
light, most of us did not believe them. This is not because we believe in Ein- 
stein’s infallibility (even he did not), but because we believe in the causal 
structure in our world. If it is absent, if events are not related by cause and 
effect, there is no point looking for laws correlating them. 

Our current view of spacetime is shown in Figure 14.1. One axis is of 
course the x-axis. Along the other we measure ct rather than t so that both 
coordinates have the same dimension, a natural requirement if we want to 
treat space and time as symmetrically as possible. (Any velocity besides c 
would do the trick but would be unnatural, given the unique role played 
by c in the theory. It would also lead to inelegant formulas, polluted by the 
presence of this arbitrarily chosen velocity.) 

Event 1 shows me at the origin of spatial coordinates when my clock 
says zero; I’m at (0,0) in spacetime. In the Newtonian world, any point 
with t > 0, say event 2 or 4, is said to be in my absolute future, and any 
event with t < 0, say event 3 or 5, is called my absolute past, and all points 
on the x-axis with t = 0 are called “present.” These labels are absolute be- 
cause all observers will agree on the order (including simultaneity) of these 
events. Events in my future can be affected by me. For example, if I decide 
at event 1 that I want an explosion to occur at 4 or 2, I can make that hap- 
pen. Events 3 and 5 are in my past—someone at 5, for example, can decide 
to harm me at 1 and do it. Thus, in the Newtonian world, we have three re- 
gions: future, past, and present, all labels being absolute, valid in any other 
frame. 

After Einstein, we draw new lines at x = ct, which describe light 
signals traveling past 1 in both directions. Thus, for example, the line x = ct 
describes a light signal that originated to my left, reached me at t = 0, and 
kept going to the right for t > 0. 
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Absolute Past 


Relativistic spacetime 


Figure 14.1 Relative to some event at the origin (0,0), Newtonian spacetime is 
divided into the absolute future (points with ct > 0), absolute past (ct < 0), and 
absolute present ct = 0, where “absolute” means for all observers. Relativistic 
spacetime, depicted in the figure, is very different. I am at the origin (0,0), 
named event 1. Event 2 lies to my absolute future, meaning that I can influence it 
and it will occur later according to all observers. Event 3 lies in my absolute past; 
it can affect me here and now at 1 and all observers will say it occurred earlier. 
Event 4 is in my future but not my absolute future, meaning I can find someone 
for whom it occurs before 1. However, no logical paradoxes will arise because I 
cannot affect 4 by using any signal that travels at or less than the speed of light. 
Likewise, event 5 is in my past but not absolute past; I can find someone who says 
it happened after 1. 


Consider now event 2. Because ct > x, there is enough time for a light 
signal to go from 1 to 2. Not only is event 2 in the future of 1 according to 
me, it will be that way according to anybody else. In other words, At is pos- 
itive for me, but, if you go back to Eq. 14.1, because ct > x, you will never 
find anybody who says 2 occurred earlier than 1. Another way to say it is 
that sitting at 1, I can make event 2, say an explosion, happen by sending a 
signal that travels slower than light. Therefore, 1 could have caused 2. And 
therefore, there is no messing around with the order of these events. Note 
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that all we ask is that 1 could have been the cause of 2 and not that it actu- 
ally was. If it could have been the cause, the theory automatically makes 1 
occur before 2 for all observers. By this logic, all events lying above the 45° 
lines x = +ct are called my absolute future. By “absolute,” I mean lying in 
the future of 1 not only according to me but according to all observers. It 
may not be later by the same number of seconds, but it will be later. Events 
in the absolute future lie in the “forward light cone.” I use the word cone 
because in higher spatial dimensions these points will lie inside a cone. 

Similar ideas apply to points in the backward light cone, labeled ab- 
solute past in the figure. Thus event 3 could have been the cause of what's 
happening to me right now at 1, because from that event, a signal could 
have been sent to arrive where I am right now at a speed less than c. 

Now consider events outside the light cone, called “elsewhere,” such 
as 4. Suppose at event 1, I open an envelope and it says something terri- 
ble is going to happen at 4. Say the distance to 4 is two seconds times the 
velocity of light, and it is to occur one second later. There is nothing I can 
do, even though it has not happened yet (according to me). In the New- 
tonian days, there was something I could have done: tell someone else to 
really hurry up and get there and do something. But now I cannot do that 
because that would require that person to travel faster than light, and that’s 
not allowed; it is outside the light cone. So, even though you know some- 
one is planning to do something evil there, you cannot avert it. That’s a 
very important thing for people who are going to law school. You know we 
got the DNA defense from biology, right? “My client’s DNA doesn’t match; 
you must acquit.” Here’s another defense from physics. If your client was 
accused of doing something at 4, and he was last seen at 1, you can argue 
that “My client was outside the light cone.” The “outside the light cone 
defense” is absolutely watertight. If an event is outside your client’s light 
cone, the client cannot be held responsible. Your client would have had 
to send a signal faster than light, and every jury knows that is impossible. 
As for an event like 4, the status is that I can actually find other observers 
moving at a speed less than light for whom 4 occurs before 1. So, the order 
of these events can be reversed. But it will not lead to logical contradictions 
because we know the two events could not have been causally connected. 
Similarly, 3 is to my absolute past, so it could be responsible for all the trou- 
ble I am having right now, but 5 cannot be blamed for what is happening 
to meat 1. 

So, spacetime, which we used to divide into the upper half plane and 
lower half planes, the future and past, separated by a line called present or 
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now, is now divided into three regions: the absolute future that you can 
affect, the absolute past that can affect you, and “elsewhere,” which con- 
tains events that cannot affect you and cannot be affected by you (sitting 
at (0,0)). 

The consequences of the Lorentz transformation, whose equations 
are deceptively simple, a lot simpler than some equations with angular mo- 
mentum, are really stupendous. If you invent a theory like this, you have to 
make sure that there are no contradictions. When you notice that the order 
of events can be reversed, you might panic. But the theory is so beautiful, 
it is so internally consistent; it says you can reverse the order of events only 
if they could not have been causally connected. Within the theory, “not 
being causally connected” means a signal would have to travel faster than 
light to go from the first event to the second. 


14.2 Geometry of spacetime 


Now we turn to something that is mathematically very pretty and physi- 
cally rather profound. 

The coordinates of a point are not sacred. They are dependent on 
who is looking at them and from what orientation. Recall the relation 
between new coordinates (x’, y’) and the old (x, y) when we rotate the axes: 


x’ = xcos0 + ysin (14.4) 
y! =—xsind + ycosð. (14.5) 


But even in this world, something is sacred: x? + y’? = x? + y?. Namely, 
the distance from the origin (or more generally, the distance between two 
points) is unaffected by rotations. This distance is called an invariant. The 
same goes for the dot product between two vectors, because it involves 
their lengths and the angle between them, none of which is affected by the 
rotation of axes. 

So, it is reasonable to ask, in the relativistic case, where people can- 
not agree on the time coordinate or space coordinate, whether the square 
of the time coordinate plus the square of the space coordinate will be the 
same for two people. We find out that that’s not the case; x? + t? is not 
invariant. But even before you do that, you should shudder at the prospect 
of writing something like this. You cannot add t? and x”, because they have 
different dimensions. We have to have both coordinates in spacetime with 
units of lengths or time. The standard trick is to introduce an object with 
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two components with the same dimensions. I’m going to call the object X. 
The first component of X is going to be called xg which is just ct. The sec- 
ond is going to be called xı, which is just our familiar x. In four dimensions 
you have X = (xo, x1, X2, X3), which is sometimes written as 


X = (xo,r) = (ct, x,y, Z) (14.6) 


where r is the usual three-dimensional position vector. Why do I switch 
from ct and x to x9 and xı? If youre doing superstrings, you need ten 
coordinates: one will be xo, the time, and the other nine will be spatial co- 
ordinates, x) ...x9. Numerical indices rather than letters of the alphabet 
are preferred, because we may run out of letters but will never run out of 
numbers. It is also more natural to sum over a numerical index than letters. 

What does the Lorentz transformation look like when I write it in 
terms of the components of X? I leave it to you to check that in terms of 
the dimensionless velocity p 


(14.7) 
= (14.8) 
(14.9) 


Now, you see the relationship is nice and symmetric. If you write 
it in terms of x and t, the coordinates transformation law is asymmetric 
because one has units of length and one has units of time. As for the other 
transverse coordinates in the y and z directions 


= i, Xe (14.10) 
In other words, the length perpendicular to the motion is something you 
can always agree on. 

Let’s return to the question of whether (x/)? + (x1)? = x6 + x? as in 
rotations. It is not. However, the following is true: 


W =O" =—a-a =" (14.11) 


where s? is called the spacetime interval. 
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Let us verify this fundamental result starting with Eqns. 14.7 
and 14.8: 


(xo — Bx)? — («1 — Bx)? 


(xp)? — (x})* = 


1— f? 
_ Gx) 0 — f*) 
= = 
=x% — xf, (14.12) 


In terms of more familiar quantities this becomes 
(ct') — "= (et)? — x". (14.13) 


If we consider the difference between two events (not necessarily 
infinitesimal), they also obey 


(Ax? — (Ax? = Axp — Ax = As? (14.14) 


because coordinate differences transform just like the coordinates. 

This result says that even though people cannot agree on the time 
or space coordinates of an event or time or space coordinate differences 
between two events, they can agree on this quadratic function of the coor- 
dinates or differences, which we called $2. It is an example of an invariant, a 
name given to something unchanged under a given transformation (which 
here is the Lorentz transformation). Note that despite the notation, $? is 
not always positive. You can verify that s? is positive inside the light cones 
when it is described as time-like; s* is negative outside the light cone when 
it is referred to as space-like; and it vanishes on the light cone when it is 
alluded to as light-like. 

While in ordinary rotations you take the sum of the squares, here you 
might take the difference of the squares. And that’s just the way it is. Even 
though time is like another coordinate that mixes with space, it’s not quite 
the same. You can go forward or backward in the spatial direction, but 
not in the time direction. The three-dimensional space in which we live is 
Euclidean, and the invariant distance is given by the sum of the squares of 
all the coordinates. Spacetime is not Euclidean. 
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Including the transverse coordinates (perpendicular to the velocity 
u), s* becomes 


s* = (xh)? — (x})* — 04)? — 066)? = 96 — ef — 9 — 9. (14.15) 


Most of the time, Pm not going to worry about transverse coordi- 
nates. 


14.3 Rapidity 


Just as trigonometric functions naturally enter the formulas relating old 
and new coordinates under rotations in the x — y plane, hyperbolic func- 
tions are tailor-made for the Lorentz transformation. If you already know 
these functions you will appreciate their utility, and if not, you can use this 
opportunity to learn about them here, in which case I urge you to fill in the 
missing proofs of various identities. You can also skip this section without 
loss of continuity. 

Let us begin with some properties of these functions and then turn 
to their application. The hyperbolic functions sinh@ (pronounced “cinch” 
0) and cosh@ are the analogs of sin and cos@, and defined as follows in 
terms of e*?: 


0 —0 

cosh0 = Ee. (14.16) 
0-0 

sinhé = = (14.17) 


A key identity follows from this definition by straightforward computa- 
tion: 


cosh? 0 — sinh? 0 = 1. (14.18) 


Note the minus sign compared to cos*@ + sin? = 1. The addition 
formulas that also follow from the definitions are 


cosh(A + B) = cosh Acosh B+ sinh Asinh B (14.19) 
sinh(A + B) = sinh A cosh B+ cosh A sinh B. (14.20) 
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The hyperbolic analog of tan 0, pronounced “tanch” 0, is: 


sinhO e&—e? 


tanh = = R 14.21 
G coshê e +e? ( ) 
It vanishes as 0 — 0 and tends to +1 as 8 > to. 
The addition formula 
inh(A + B tanh A + tanh B 
aaee e (14.22) 


cosh(A+B) 1+tanhAtanhB 


follows from the addition formulas Eqn. 14.19 and 14.20 for sinh and cosh. 

Now for the application to relativity, starting with the velocity addi- 
tion formula. We have seen that if an object has velocity w according to 
you and you are moving with a velocity u with respect to me, its velocity 
with respect to me is not v = w + u as in non-relativistic physics but 


y . wat 
C Cc 
== (14.23) 


where the denominator 1 + “#7 plays a crucial role in keeping velocities 


from exceeding c. 
Suppose we express the dimensionless velocities 7, 7, and “ in terms 
of the rapidities 0u, 0y, and 0, defined as follows: 


” =tanhd, (14.24) 
C 
` 
-= tanhé, (14.25) 
C 
Y =tanhOy. (14.26) 
C 


Observe that both sides lie in the range [—1,+1]. You can invert these 


formulas to find the 0 corresponding to some dimensionless velocity. For 
u, 


example, in the case of 6 = z: 


0, =\n ELA (14.27) 


1= £ 
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The relativistic velocity addition formula now takes the form 


tanh6,, + tanhd,, 
1+ tanh „tanh, 


tanhd, = = tanh [0y + 0u] (14.28) 


by virtue of Eqn. 14.22. This leads to the remarkably simple result 
0, = Oy + Oy. (14.29) 


In other words, rapidities simply add when compounded, unlike veloci- 
ties. Thus if you see an object moving at rapidity 0w, and you are moving 
with respect to me with a rapidity 6,,, I will ascribe to the object a rapid- 
ity 0, = Ow + 0u. While this means rapidities can grow without limit when 
we jump from frame to frame, the property |tanh@| < 1 ensures that the 
corresponding velocity never exceeds c. 

Now consider the Lorentz transformation. Given that 


K =tanhd, (14.30) 
c 
sinhd,, 
= cosh ey) 
ov cosh? @,, — 1 (14.32) 
7 cosh, ` 


we can readily invert the above to obtain the following expressions for 


cosh@, and sinh 0,: 


(14.33) 


AT NE B (14.34) 


which satisfy cosh? 6,, — sinh? 6,, = 1. 
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We may rewrite the Lorentz transformation, Eqns. 14.7 and 14.8, in 
terms of 0,, as 


x9 = xo cosh ĝ„ — xı sinh ĝ„ (14.35) 
A = — x sinh 0, + xı coshd,, (14.36) 


which are analogous to the formulas for rotations. The invariance of the 
spacetime interval 


(4)? —(4) =xp -—xp = (14.37) 


follows from cosh? 6 — sinh? 8 = 1, just as x24 y? = x? +y? follows from 
cos? 0 + sin? 0 =1. 

Suppose a Lorentz transformation parametrized by 6; relates X’ to 
X and then a second one involving 0, relates X” to X’. If we now directly 
express X” in terms of X by eliminating X’, we will find that it is given 
by a Lorentz transformation with rapidity 0) + 62. (This will require using 
the addition formulas Eqns. 14.19 and 14.20.) Again this just means that 
rapidities add under a sequence of two Lorentz transformations just like 
rotation angles add under a sequence of two rotations. 

Even a cursory reading of this section should impress upon you how 
marvelously the inventions of mathematicians fulfill and often anticipate 
the needs of the physicist. 


14.4 Four-vectors 


Let me introduce some notation now. Our friend X = (x9, x1, X2, x3) = 
(xor) is the first example of a four-vector. Later we will see other four- 
vectors with four components each. If A = (ap, a1, a2, 43) is a four-vector, 
its components will transform like those of X: 


= 14.38 
0 J-P ( ) 
1 ay = Bao 

a= = (14.39) 

ay =a, (14.40) 

a; = m (14.41) 


if the relative velocity is in the a; direction. 


Special Relativity III: Past, Present, and Future 239 


This transformation law is what defines a four-vector, more pre- 
cisely a four-vector under Lorentz transformations, just as r transforms 
like a three-vector under ordinary rotations. Just as we took the three 
components of space and added one more of time to form the posi- 
tion four-vector X, every four-vector we encounter will be the union of 
a three-vector like r and a fourth component that, like time, is unaffected 
by ordinary rotations but gets mixed up with the others under Lorentz 
transformations. 

Now we're going to define a dot product of two four-vectors A and B: 


A-B= ag bo — ay by. (14.42) 
I leave it to you to show that in a different frame 
A' - Bl = aqb) — a,b, = abo — aibi =A- B (14.43) 


for the same reason that X - X was invariant under the Lorentz transfor- 
mation. 


14.5 Proper time 


Now I am going to apply the notion of the spacetime interval to the study 
of a single particle. Previously, dx and dt were separations between two 
random, unrelated or arbitrary events. But now I want you to consider the 
following events. A particle moves in spacetime from point 1 to point 2, 
traveling a distance dx in time dt according to some generic observer. So, 
these are two events in the life of a particle, lying on its trajectory. Let’s look 
at the infinitesimal spacetime interval (ds)? = c? dt? — dx? between the two 
events. Were going to rewrite this as follows: 


1 (dx\? 
(ds)? = Cd? — dx? = (edt) (1-5 (=) ee 
y2 
ds = cdt,/1— = (14.45) 
(0 


where de = v is the velocity of the particle. Because ds is an invariant, the 
same for all observers, let us calculate it according to the particle itself. 
The particle does not think it is moving and so it sets v = 0. So the two 
events, particle sighted here and particle sighted there, have different x 
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coordinates for a generic observer, but they are the same point according 
to the particle. However, the particle does think some time dz has elapsed. 
Therefore, in terms of the dt of the particle 


ds = cdt. (14.46) 


The time t, measured by a clock moving with the particle, is called proper 
time. In other words, if the particle had its own clock, dz is the time it 
would say elapsed between the two successive points in its trajectory. It’s 
not hard to understand why everybody agrees on proper time. You and I 
don’t have to agree on how much time elapsed between when the particle 
was here and when it was there. But if we ask how much time elapsed 
according to the particle, we are all asking the same question, and we all get 
the same answer, dt. 

Remember for future use that the proper time difference dt between 
two events in the trajectory of a particle is related to the time difference dt 
according to an observer who ascribes to the particle a velocity v as follows: 


| 2 
dt = dt,/1— a which means (14.47) 
c 


A = : : (14.48) 


CHAPTER 15 


Four-momentum 


In Newtonian mechanics particles have coordinates, let us say (x, y), which 
could vary with time. From these we form a two-dimensional vector r = 
ix +jy. Ina rotated frame the components (x’, y’) are given by 


x’ = xcos0 + ysin (15.1) 
y = —xsin0 + ycosð. (15.2) 


An entity V is defined as a vector (in two dimensions) if it has two 
components (Vx, Vy) that, under rotation of the axes, go into v and V 
related to (Vx, Vy) exactly as in Eqns. 15.1 and 15.2. 

Now, I say to you, “Okay, that’s one vector, the position vector r. Can 
you point to another vector?” You might suggest the velocity v = a as 
an answer. That is right, but why does taking the derivative of a vector 
produce another vector? Well, what’s the derivative? You change the vec- 
tor by some Ar, and you divide it by the time difference At. Now, the 
change in the vector is obviously a vector, because the difference of two 
vectors is a vector. Dividing by time is like multiplying by the reciprocal 
of the time. That’s like multiplying by a number or a scalar, because time 
does not respond to rotations. And I’ve told you multiplying a vector by a 
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number also gives you a vector, maybe longer or shorter. Therefore, Ar/At 
is a vector because it’s the difference of r later minus r now, and divid- 
ing by a tiny At is the same as multiplying by a large number—10, 000 or 
1,000,000, it doesn’t matter. And the limit v = g is also a vector. There- 
fore, when you take a derivative of a vector with respect to a parameter 
like time, which does not respond to rotations, you get a vector. Once you 
get this derivative, it becomes addictive. You can take second derivatives, 
and you get the acceleration a. Then you can multiply that by mass or in- 
ertia, which is postulated to be a scalar or invariant under rotations. Thus 
the product ma is a vector, which Newton equates to the force F, also a 
vector. 

What I want to do is generate more four-vectors using this idea of 
starting with the position four-vector X. Now, take this X to be the coor- 
dinate in spacetime of an object that’s moving. I want to take the derivative 
of that to get myself something I could call the velocity vector in relativ- 
ity. But the derivative cannot be the time derivative. I can of course take 
the time derivative, but the time derivative of a vector in four dimensions 
is not a vector, because time is like any other component. It’s like taking 
the y derivative of x for a moving particle to get the velocity. That doesn’t 
give you a vector. You have to take a derivative with respect to something 
that does not transform under the Lorentz transformation, something that 
does not change from one observer to the other. Do you have any idea 
where I’m going with this? I mean, you can take the derivative with respect 
to t, the time as measured by the particle. So, Pm going to form a new 
quantity V called four-velocity, 


v-5-(2 2). (15.3) 


dt dt’ dt 


By construction, V will be a four-vector. By that I mean, its four com- 
ponents will transform when you go to a moving frame just like the four 
components of X. But we do not have a good intuition for the qt -derivative. 
While dx and dt are the separations measured by a generic observer, dt 
is the time elapsed according to the particle. So we rewrite Eqn. 15.3 for 
V in terms of quantities that are directly measured by the generic ob- 
server. To this end, we replace t-derivatives by t-derivatives by invoking 
Eqn. 14.48: 


Four-momentum 


_ dX dX dt 


dt dt dt 
dx 


1 
“al ye dt 
1 dt dx 
= =; 
/1— v/e \ dt dt 


_ c 1 dx 
7 V1=72/ J/1—-P/e2 dt 


B c v 
7 VIZ vÈ J/1—v/2 


243 


(15.4) 


(15:5) 


(15.6) 


(15.7) 


(15.8) 


where the last equation applies when all four dimensions are exhibited. 
The four-velocity has an unusual feature: its “length squared” 


V-V= V-V =e 


(15.9) 


does not depend on how fast the particle is moving! You can verify this the 
hard way, by computing VŽ — V? starting with Eqn. 15.7, or the easy way 
by evaluating the invariant in a frame in which the particle is at rest and 


V = (c,0). 


Now we are ready to define the four-momentum P as the mass m 


times the four-velocity V: 


dX 
P = m— 


dt 


= MC mv _ 
| =e Aae (Po; P1) 


in two dimensions 


= mce mv z 
E VI-v/2 S1- 7e = (Po, Pi» P2» P3) 


in four dimensions. 


(15.10) 


(15.11) 


(15.12) 
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For this to be a four-vector, m should be the same in all frames, that 
is, invariant under Lorentz transformations. 

So, we have manufactured a new beast with four components. What 
is it? Let me keep just the component of velocity in the x direction, but still 
call it a four-vector. 

Consider first 


mv 
ne Jl—v2/e2 


For = << 1, we find (upon setting the denominator to unity) 


(15.13) 


pı = m. (15.14) 


Thus we conclude that pı stands for the momentum of the particle in the 

relativistic theory. However, if the particle picks up speed, we need to take 

into account the denominator. As v — c, pı grows without limit: that is, in 

this theory, there is a limit to the particle velocity but not its momentum. 
Some people like to write 


v= m(v)v (15.15) 


n= (z) 


where m(v) = m/y 1 — v? /c? is a new velocity dependent mass. They also 

refer to m(0) = mas the rest mass mọ. Their point is that if you introduce a 

velocity dependent mass, then momentum can still be mass times velocity 

as in the old days. We will not do that: for us m is always the rest mass, and 

momentum is now a more complicated function of this mass and velocity. 
v 


Suppose = is small but not utterly negligible. Then we can use a 
slightly better formula for momentum by using 


(l+x)"=1+nx+... 


for x << 1, to write 


1 rr v2 
= |1- — = 1 + — +... (15.16) 
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and 


where the ellipses (dots) stand for even smaller corrections we are ignor- 
ing. We can calculate more such correction terms or simply use the exact 


expression with the ,/ 1 — a in it. 
It is also clear that if we bring in pz and p3 we just get the vector 


lic (15.18) 
P=: . 
J1—v?/c? 
What does the 0-th component 
is (15.19) 


ag V1l—v*/c? 


stand for? If we set v = 0 we get the mass of the particle times c, which is a 
constant. Let us go to the next level of approximation and write as before, 


La 
Pom mes mv +... (15.20) 
c 


We see that if we multiply both sides by c finally something familiar 
emerges: 


1 
cpo = me? + 5mv* +... (15.21) 


We see that the second term on the right is just the non-relativistic ki- 
netic energy. So it must be that all higher powers of = shown by the ellipsis 
stand for corrections to kinetic energy as we consider faster particles. 

But it must then be that the first term mc? also stands for energy, but 
of a particle at rest. This is called its rest energy. Einstein did not tell us how 
to extract this energy (in contrast to kinetic energy of motion, which can be 
extracted, say, in hydroelectric power generators using turbines that slow 
down the water). Later on, when people discovered fusion or fission, they 
found that some amount of mass was missing at the end of the reaction 
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and that this missing mass (upon multiplying by c?) exactly equaled the 
additional kinetic energy of the final particles. For this reason P is called 
the energy-momentum or momentum four-vector. 

To summarize, we have seen two four-vectors: 


X = (xo,X1) = (ct,x) the position four-vector (15.22) 


E 
P= (po pı) = (Że) 


the energy-momentum or momentum four-vector 
(15.23) 


Here are some consequences of P being a four-vector. 


e The components of P transform as follows when we go from one frame to 
another: 


Po= J-F (15.24) 
p= (15.25) 
P= (15.26) 
Ps =D (15.27) 


where J = <. We will ignore p2 and p3 from now on. 

It will be instructive for you to verify that this transformation law 
is in accord with another way of finding pj and p}. To this end, consider 
a particle moving at speed v as seen by the unprimed observer. Write ex- 
plicit formulas for po and pı in terms of m and v and keep them handy. 
Now ask what velocity w this particle will have according to the observer 
moving at velocity u. Write down her expressions for pp and p} in terms 
of w, and now write w in terms of u and v. Check that the result agrees 
with what is above. To save paper you may set c = 1 in this exercise. 

e P,- Pp is invariant, where Pa and Pg are any two four-momenta, which 
could refer to two different particles A and B or the same particle, in which 
case A= B. 
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First consider just one particle and the value of P - P. Because it 
can be found in any frame, find it in its own co-moving frame. In this 
case pı = 0 and po = mc. Thus 


P =P.P=p -p =ne. (15.28) 
You may verify that if you went to a generic frame and wrote down 
expressions for po and p; in terms of v, you would get the same result. 


If there are two particles we may assert that 


E,Ep 
P4- Pg = paopzBo — Pai pBi = 


Z PAIPBI (15.29) 
will have the same value for all observers. In the rest frame of particle B, 
P4- Pg = Eamp where by E4, I mean the energy of A as measured in the 
rest frame of B. 

A photon has no mass. This means 


K-K=0 (15.30) 
where K is a common name for the four-momentum of any photon. 
The photon has no rest frame; in any frame it moves at c. However, its 
components (ko, kı) will undergo Lorentz transformation as we change 


frames of reference. The components of K again stand for E/c and 
momentum of the photon. Zero mass means that 


0=K-K =k% -k — k = +h. (15.31) 
In the above equation, the spatial momentum kı can be positive or 
negative, but ko, which corresponds to energy, is always positive. 


The energy and momentum of photons are usually denoted by œ 
and k: 


Kaluka (2x). (15.32) 
c 
The zero-mass condition becomes 


pa (15.33) 
C 
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An equivalent way to show the zero-mass condition explicitly is 
the following: 


K=(k,k) right moving photon (15.34) 
K=(k,—k) left moving photon (15.35) 


where it is understood that k > 0. Thus a single number, the photon 
momentum +k, fixes the K in one spatial dimension. 
You could equally well write K in terms of œ instead of k: 


K= (=. =) right moving photon (15.36) 
ge 

K= (2, -2) left moving photon. (15.37) 
E € 


e Four-momentum, if conserved in one frame, is conserved in any frame 
related by a Lorentz transformation. 
The virtue of momentum as we derived it is that if it is conserved 
in one frame, it will be conserved in any other. For example, if particles 
A and B turn into C, D, E, and in one frame we have 


Pinitial = Pa + Pg = Pc + Pp + Pe = Prinal (15.38) 
then in any other frame we will have 


Pinisil = Pa + Pp = Pot Pp + Pg = Pinal (15.39) 
because if two vectors Pinitia) and Pringi are equal in one frame they are 
equal in any other. This must be clear from analogy with usual vectors: 
if A +B = C, that is, the three form a triangle in one frame, then they 
will also form a triangle in a rotated frame. Or one can say that because 
A+B—C=0O, the null vector, the left-hand side will be the null (zero) 
vector in any frame because the rotated version of the null vector is the 
null vector. In the case of the Lorentz transformation, if Pfinal — Pinitial = 
0, the difference vector will vanish in any frame if it vanishes in one. 

It also follows that if you did not like my definition of four- 
momentum and made up your own, yours may not have the property 
of being conserved in all frames if conserved in one. Conservation in all 
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frames is what makes momentum an important quantity, and for that 
it has to be a four-vector. 


15.1 Relativistic scattering 


Let us consider some examples of relativistic kinematics. 


15.1.1 Compton effect 


Imagine a photon of momentum K moving along the x-axis, bouncing off 
a stationary electron of momentum P, as shown in Figure 15.1. This pro- 
cess is called the Compton effect. We consider the one-dimensional version 
of this process, in which the final electron and photon are forced to travel 
along the x-axis. What is the energy of the outgoing photon? I will show 
you how to do this problem using units in which c = 1. This makes the 
manipulations easier. I will also show you how we can eventually reinstate 
factors of c guided by dimensions. With these units, the initial and final 
photon momenta look as follows: 


K=(0,0), K'’=(a',—-a’). (15.40) 
The electron’s four-momenta are, before and after, 
P=(m,0) P=(E,p') with E? -— p° = m. (15.41) 


In preparation for what follows, let us compute some dot products of 
four-vectors using 


A-B= ag bo — ay by. (15.42) 
BEFORE AFTER 
y— 0 O-> 
k=(k, k) P=(mc,0) K’=(k,-e) P'E p’) 


Figure 15.1 A photon bounces off an electron at rest. This is the 
one-dimensional version of Compton scattering. 
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The following will be needed: 


K- K' = (aa! — (@)(—@’)) = 200 (15.43) 
P- K = (mo — 0 œ) = mo (15.44) 
P. K' = (mæ — 0 œ) = ma’. (15.45) 


Begin with the conservation law 


K+P=K'+P. (15.46) 


Because no one cares about the details of P’, the scattered electron, we 
isolate and square it (take the dot product with itself) because we know 
that for any particle, no matter how it is moving, the square of the four- 
momentum will be m?c? = m? (with c = 1). Here are the details. 


P. P = (P = m? = (P +K — K’)? (15.47) 
= P? +K? +K? +QP-K-2P-K'—2K-K') 
(15.48) 
=m +0+0+2(mo-— mo! — 20’) 
(15.49) 
0 = mo — œ) — 200 (15.50) 
i 1 2 
1 1 2 


where, in the last equation, I restored the c? because œw is an energy and so 
is mc’. In the three-dimensional case, the photon can emerge at an angle 
0 relative to the x-axis and we find 


1 1 1—cosdé 
+ ; 


o mc? 


(15.53) 
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(Equation 15.52 corresponds to 0 = m.) Compton did the scattering and 
confirmed this prediction, and this was very instrumental in convincing 
the community of the reality of the photon as a particle. 


15.1.2 Pair production 


What is the minimum energy E of the incident proton that strikes the pro- 
ton at rest so that in the end we have a p, p, p , and p, that is, three protons 
and an antiproton as in Figure 15.2? (Anti-particles have the same mass as 
particles.) 

The energy-momentum of the incident proton is Pı = (E, p), that 
of the target is P) = (m,0). (Remember c = 1.) In the lab frame the total 
momentum is 


P= Pi +P, = (E+ m, p). (15.54) 


In the minimal reaction the final four particles will have the mini- 
mum energy, but they can’t all be at rest due to momentum conservation. 
So we go to the CM frame in which the two initial protons approach 
each other with opposing spatial momenta; that is, their four-momenta 
are (Ecm Pem) and (Ecm, — Pem). Thus the initial total momentum is py = 
(2E¢m,0). The final four-momentum of the four particles, now allowed to 


be created at rest, is 


CM 
Pro = (4m, 0). (15.55) 
Lab, before _, bab, after = 
eo- © (XJS 
CM, before CM, after 
©- —o Ses 


Figure 15.2 Top: The collision of a projectile proton and a static target proton, 
leading to three protons and an antiproton produced with the minimum energy. 
The final four particles cannot be at rest in the lab frame due to momentum 
conservation. Bottom: The minimal collision as seen in the CM frame, where the 
final four particles are allowed to be at rest. 
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Now recall that for any four-vector V, V - V is invariant, the same in 
all frames. Applying this to the total momentum we find, for the minimum 
energy process, 


CM frame Pr: Prop = 16m 
Lab frame Prot: Prot = E? +m +2mE— pP =m +m +2Em. 


Equating the results of Prot - PTot computed in two frames gives the desired 
result 


E=7m=7mc’. (15.56) 


If we thought only about energy, we would guess that the proton 
coming out of the accelerator should have an energy E = 3mc, which, 
along with the mc? of the target proton, would be able to produce the fi- 
nal four particles at rest. But momentum has to be conserved, of course, 
and the projectile has to have an energy 7mc’, not 3mc’, to pay for this ex- 
tra mandatory kinetic energy of the final four particles. The Bevatron was 
constructed in the Lawrence Berkeley National Laboratory with the energy 
per proton slightly above 7mc*. Using it, Owen Chamberlain and Emilio 
Segré successfully produced the antiproton in 1955. 

At the Large Hadron Collider in CERN (European Organization for 
Nuclear Research), protons circulating one way collide with protons circu- 
lating the other way at the same energy, so that the lab frame is also the CM 
frame. Now all the beam energy goes to particle production. Each colliding 
proton would have to have just E = 2mc? to make the final three protons 
and an antiproton at rest. Of course the Large Hadron Collider is interested 
in producing the Higgs boson and bigger fish and not the antiproton. 


15.1.3 Photon absorption 


Finally, consider the case of photon absorption. If an atom absorbs a pho- 
ton of energy œ, the atom will recoil, and its mass will increase due to 
the absorbed energy. We want to find the new mass. This confuses some 
students, who say, “But you said the mass of a particle is unaffected by its 
momentum: even if E and p change with velocity, E? — c? p? = m? c* always! 
So what does it mean to compute the new mass of the atom?” As long as 
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a particle retains its identity, as the electron and photon do in Compton 
scattering, it will preserve its mass, but that is not the case here: a photon 
has disappeared and an “excited” atom has appeared. The atom in an ex- 
cited internal state is to be viewed as a different particle, with its mass m’ as 
a free parameter. We want to find the value of m’. (The electron does not 
seem to have internal states of different energy, which is why we did not 
have this issue in Compton scattering.) 

Your guess may be m = m + /c’. Let us work it out, following 
Figure 15.3, this time keeping the explicit factors of c and also writing 
down the conservation law for each component of the four-momentum. 


Let 
P =(mc,0) (15.57) 
k= (=.=) = (kk) (15.58) 
coe 
mc mv 


> 15.59 
Jl—v/e2 y1- v/e i ) 


be the four-momenta of the initial atom at rest, the incoming photon, and 
the final atom respectively. The conservation law for four-momentum or 
energy-momentum is 


P+K=P. (15.60) 
BEFORE AFTER 
¢— 0 — 
K=(k, k) P=(mc,0) P’=(E’/c, p’) 


Figure 15.3 A photon is absorbed by an atom of mass m, which recoils and goes 
into an excited state. It is viewed as a different particle with a somewhat bigger 


mass m. 
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In terms of components: 


/ 


w mc 
mc + — = ——— conservation of energy of E/c; 
c  V1—v*/c? 
(15.61) 
mv . 
0+ k = — momentum conservation. (15.62) 


JV1—v2/c2 


Now you can juggle these equations and solve for m, but again there 
is a quicker way using four-vectors. Because we just want m’, we need only 


calculate P’ - P! = mc? as follows: 
mc = P'. P (15.63) 
= (P+K)-(P+XK) (15.64) 
=P-P+K-K+2P-K (15.65) 
=m +0+2(m —0-~) (15.66) 
c ë 


m = [m+ 2m2. (15.67) 
C 


We can approximate as follows for small >: 
mc 


, o w o 
m =m, |1 +2— = m(l1 + — +...) = m+ — + ...(15.68) 
mc? mc? c2 


in accord with the naive expectation that ignores recoil. In other words, not 
all the photon energy can go into boosting the atom’s rest mass, because 
it also needs to move to conserve the initial photon momentum. Thus the 
increased rest energy plus kinetic energy has to equal the photon energy. 

Remember the following tricks when you deal with relativistic colli- 
sions: (1) In the four-vector equations square that four-momentum about 
which you know the least, because the answer for the square is always m? c?. 
(2) Sometimes the momentum you need to square may not be standing 
alone in one side of the equation. If this happens, isolate it (by moving 
other terms to the other side) and square it. 


CHAPTER 16 


Mathematical Methods 


16.1 Taylor series of a function 


I am going to introduce you to some mathematical tricks. As you’ve prob- 
ably noticed by now, a lot of physics has to do with mathematics, and if 
youre not good in math, you're not going to be good in physics. 

The first important trick is called the Taylor series. The philosophy of 
the Taylor series is the following: There is some function f(x) depicted in 
Figure 16.1. But ’m going to imagine that you can only zero in on a tiny 
region near x = 0. And the question is, how will you write an approxima- 
tion for this whole function, valid away from x = 0? Suppose I don’t show 
you anything except what’s happening at x = 0; I show you only f (0). The 
value of the function is 92. What should you do away from x = 0? You have 
no additional information about this function; you don’t know if it’s going 
up or going down. The best approximation you can make is the flat line in 
Figure 16.1. There’s no reason to tilt it one way or the other, given the in- 
formation you have. If you do not pick the constant to be 92 you will even 
miss the answer at the one place (x = 0) where you were given the value. 
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f(x) 


f(x}=F(0)+#(0) x 


f(x)=#(0) 


Figure 16.1 A sample function f(x) and two approximations to it near x = 0 
based on the Taylor series. The flat line f (x) = f (0) assumes f(x) does not change 
as we move away from x = 0. It is based on the first term in what is called the 
Taylor series. The second line f (x) = f (0) + f’(0)x is the linear approximation 
based on two terms, and it matches the function and its derivative at x = 0. It 
assumes the slope does not change. 


So, the first approximation of the function you will say is 


f(x) =f()+... (16.1) 


where the ellipsis means there may be corrections away from x = 0 but 
you do not know them. The left-hand side stands for the actual function 
and the right to an approximation that matches the real thing at x = 0 
with possible unknown corrections as we move away. If it turns out that 
f(x) is really constant, the horizontal line in Figure 16.1 will be the exact 
representation of the function. 

To set f(x) = f (0) is like saying “The temperature today (x = 0) is 
92. I don’t know anything else, so my best guess away from x = 0 would be 
92, shown by the horizontal dotted line f(x) = 92.” But if you know that 
this is summer and that the temperature is going up from day to day at 
a known rate, and somebody tells you, “I know that the rate of change of 


df 


temperature is +| today,” you can use that information to predict what 


the temperature will be tomorrow or what it must have been yesterday. 
That is to say, for x Æ 0 you can improve our approximation to f using 
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your knowledge of its derivative at x = 0: 


fe) =f + É x+... (16.2) 
coal) 
=f(0)+f'(0)-x+... (16.3) 


where I am using another popular notation 


df af 


dx’ axe?” 


where the number of primes denotes the number of derivatives. This nota- 
tion is convenient if we do not plan on taking too many derivatives. Since 
f'(0) is the derivative at x = 0, when you multiply it by x you get the best 
guess for the change as we go away from x = 0. What we are approxi- 
mating the function by is a straight line with the correct intercept and the 
correct slope, shown in Figure 16.1. If it turns out that the function really 
is a straight line, you are done. It’s not even an approximation; it will track 
the function all the way to infinity. But it can happen, of course, that the 
function decides to curve upward, as I’ve shown in this example, and this 
linear approximation will not work if you go too far. For a while, you'll be 
tangent to the function, but then it will bend away from you. So, it’s really 
good for a very small x and you can say, “Well, I want to do a little better 
when I go further out.” 

This approximation ignores the rate of change of the rate of change. 
The linear approximation assumes that the rate of change is fixed at the 
rate of change at the origin. And the rate of change of the rate of change is 
the second derivative f” (0). Suppose you were given that as well. How do 
you use it? The answer is 


f= = (16.4) 


2 
f(x) =f (0) +f O) -x+ f0- gie (16.5) 


Why should you divide by 2!? Because our goal is to generate an approxima- 
tion that has whatever you know about the function built into it and the 4 
ensures it has the right second derivative at x = 0. Let’s check that and the 
other features as well. 

First of all, let’s compare the two sides of Eqn. 16.5 at x = 0. The left- 


hand side is f (0). On the right-hand side, when you put x = 0, you kill the 
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x and x? terms and are left with f (0), and that matches the left-hand side. 
So, you certainly have the right value of the function here. 

Then you say, “What if I take the derivative of the function and com- 
pare at x = 0?” Let’s first take the derivative of both sides of Eqn. 16.5. 
We find 


f(x) =0+f'O) +f" O)x+... (16.6) 


The left-hand side is f'(x). When we take the derivative of the right-hand 
side of Eqn. 16.5, the first term f(0), being constant, gives a 0. The deriva- 
tive of the x in the second term is 1, and finally the derivative of x? is 2x, 
so I get f” (0) times x. Now we evaluate the derivative at x = 0. That kills 
the term with a residual x, and the derivative of my approximate function 
matches the derivative of the actual function. 

How about the second derivative of this function? Take two deriva- 
tives on the left-hand side of Eqn. 16.5 to get f” (x) on the left. See what 
happens on the right-hand side. If you take two derivatives, the only sur- 
vivor is the x? term, which leaves f” with a coefficient 1. Now, if you set 
x = 0, the left becomes f”(0) and so does the right. Thus with the 3, the 
function you have cooked up has the right value, the right slope, and the 
right rate of change of the slope at the origin. 

It’s very clear what you should do to go beyond. If you knew more 
derivatives, the approximation you would write would be 


L Pf 
gat dx 


S d'f 
o 3L U dalo 


df 


dx? 


a ies 


(16.7) 


f(x) =f) + Z -s+ 


And you go as far as you can. If you know 13 derivatives, put in the 13 
before you surrender to the dot-dots. That’s still an approximation, unless 
the function happened to be a polynomial of degree 13. 

Now, sometimes you hit the jackpot, and you know all the deriva- 
tives. If someone tells you all the derivatives of the function, then why stop? 
Add them all up: 


.-— (16.8) 
o 1! 


k oo d”f 
f=), as 
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where the n = 0 term is just f (0). 

And if you do the infinite summation at every value of x, and, if that 
summation is meaningful and gives you a finite number, that, in fact, is 
exactly the function you were given. That is the Taylor series: an infinite 
number of terms that, if they sum up to something finite, will actually be 
as good as the left-hand side. 


Here’s a famous example: 


fx) = — (16.9) 
=x 


You and I know this function; we know how to put it in a calculator; we 
know how to plot it. You give me an x and I subtract it from 1 in the 
denominator, and that’s the value of the function. But instead, suppose 
this function was revealed to us in stages. Say we were told just f (0). What 
is f (0) here? Set x = 0 and get f (0) = 1. Now, lets take the derivative of this 
function, 


df 
Aar (16.10) 


and evaluate it at x = 0, to get f’(0) = 1. 

If you now take the second derivative of this function, which I don’t 
feel like doing, and evaluate that at x = 0, you will find it is equal to 2!. In 
fact, the n-th derivative at the origin is n!. That’s very nice, because then 
the Taylor series becomes 


CO 
f@=Hl+x4H H. YO x" (16.11) 
n=0 


The Taylor series is this infinite sum. In practice, you may be happy to just 
keep a couple of terms. 

So, let’s get a feeling for what those couple of terms can do for us. 
Let me take x = 0.1; the real answer is + = 1.1111..., where the 1’s go 
on forever. That’s the target value. What do you get with the series? The 
series starts at 1, plus one-tenth, plus 1 over 100, plus 1 over 1000, and 
so on. And you can see, as I keep more and more terms, I keep just fill- 
ing up these 1’s. If you stop at 1 over 1000, you stop right there, at 1.111. 
I hope it is clear to you, perhaps from this simple example, that if you 


260 Mathematical Methods 


kept all the terms of the series, you really will get this infinite number of 
recurring l’s. 

Consider the infinite Taylor series. Now, summing an infinite num- 
ber of numbers is a delicate issue. I don’t want to go there at all; I discuss 
that in my math book. But here are some caveats. Sometimes a sum makes 
no sense, and you have to quit. For example, put x = 2. The correct func- 
tion is > which is —1. Our approximation for x = 2 looks like 1+2+4+ 
8.... First of all, this sum is going to grow to infinity, because the numbers 
are getting bigger and bigger. This sum seems to be all positive, while the 
correct answer is negative. Obviously, the series doesn’t work. So, the next 
lesson of our Taylor series is this: you can write down the series, but it may 
not sum up to anything sensible beyond a certain distance from the start- 
ing point. So, if youre doing a Taylor series at x = 0, and you go to x = 2, 
you may find, as we did above, that it just doesn’t work. So, you can ask, 
“How far can I go from the origin?” Well, in this simple example, we know 
that at x = 1 the function is going to infinity; that’s why you couldn’t go 
there or past that to the right of that point. The function is well defined on 
the other side for x > 1, but this series, this knowledge of the function and 
all its derivatives at the origin, is not enough to get you to the other side. 
So, this is a case where there are obvious problems at x = 1. It turns out that 
you cannot also reach or go to the left of x = —1, even though the function 
has no obvious problem there. Or consider a function like 1/(1 + x’), with 
no evident troubles for any x. And yet, if you take the Taylor series for it, 
you will find that if you go beyond |x| > 1 the series blows up. (This prob- 
lem arises because the function blows up at x = +7. Here you really need to 
look in the complex plane to see how far you can go before the series breaks 
down.) 

I don’t want to go into the mathematical theory of series. I just want 
to tell you that functions can be approximated by series. And, if youre 
lucky, you can get by with a few terms. If you are luckier and know all 
the derivatives, the whole sum may converge to give a finite answer within 
some interval, in which case, the series is as good as the function. One 
person can use 1/(1 — x), the other one can use the infinite series, and 
both are morally and mathematically equal in every sense, as long as they 
don’t stray outside the region of validity of the infinite series. 
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Finally, note an obvious generalization: if f and its derivatives are 
given at x = aand not at x = 0, by the same logic, we may write 


l (x = a)" 


(16.12) 
n! 


[0,0] at 
f@®= 
2 dx" |y 
where the n = 0 term is f (a). 


16.2 Examples and issues with the Taylor series 


Now let’s return to a popular example that I’ve been using a lot, (1 + x)”. 
That’s a function for which we want the Taylor series. We see f (0) is 1. 
What’s the derivative of the function? It’s n(1 + x)"~!, which reduces at 
x = 0 to just n. That’s how we get the famous result 


(l+x)"=l+nx-+... (16.13) 


If x is small enough, you stop there, because the next term is going to in- 
volve an x? and then an x? will follow and so on. If x is tiny, we have no 
respect for big powers of x; we just cut them off. But if we want the next 
term, we'll have to take the second derivative 


(1 +x)" 


e~ n(n—1)(1+x)"?, (16.14) 


which has a value n(n — 1) at x = 0 so that now 


mo + 


(1+x)"=1+nx + ——— (16.15) 


What happens if we consider the case when 1 is a positive integer like 
2? After all, we know from the days in the nursery that 


+x)? =14+2x+x’, (16.16) 
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while the series above gives 


2(2=1 
Otasiz ED hH, (16.17) 


What are the ellipses doing there when we have the whole answer already? 
Luckily, the next term in the series will have a factor 2(2 — 1)(2 — 2) that 
vanishes, as will all subsequent terms which will also contain the (2 — 2) 
factor. By this argument, the expansion will terminate after the x” term if n 
is an integer. For other powers, fractional or negative, this will not happen. 
(I suggest you play with n = —1.) 

Now for an example from relativity. The energy of a particle is 


1 
2 272 1 
pe E ne 1- = me + -m +... (16.18) 
2 
JV1—v2/c? c 2 


We never knew the first term was there (it cancels out of all equations 
where we balance kinetic energy). We just kept the first nontrivial term 
mv? and did mechanics this way for three hundred years. So, the approx- 
imations can really be useful. If you say, “Well, I want to be exact,” you can 
go back and use the full ,/1 — v?/c?. Unfortunately, somebody or other 
will tell you, “That’s not exact either. There is quantum mechanics, which 
tells you the whole thing is wrong, that particles do not move on trajecto- 
ries.” I have a lot of respect for approximations. If we could not describe 
the world approximately, we could not have accomplished what we have. 
No one knows the exact answer to a single question you can pose: if your 
question requires an answer to arbitrary precision, we just cannot pro- 
vide it. Sometimes we do not even know if the question is meaningful in 
a more advanced theory, as was the case with trajectories when we moved 
from Newtonian to quantum mechanics. Newtonian mechanics works for 
small velocities. Relativistic mechanics works for any velocity, but not for 
really tiny objects for which you have to use quantum mechanics and then 
relativistic quantum mechanics. If we bring in gravity we need general rel- 
ativity. So, as old theories always give way to new theories, approximations 
are very important as we inch forward. 
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16.3 Taylor series of some popular functions 


Now I’m going to consider the following function: e*. This is one you all 
know and love. Every child knows e* is its own derivative. This means every 
derivative of e* is e”. Why do we like that here? Because all the derivatives 
at the origin are known and equal just 1. It follows that 


=y L, (16.19) 


Now I need to know the value of e, because when I lock my suitcase 
and check it at the airport, I use either e or z , because they’re the only two 
numbers I can derive, as compared to my anniversary. So, if I forget the 
value of e, I just say 


=e! =1 re ares 16.20 


and keep finding terms till my suitcase opens. It is roughly 2.718, which 
is enough for most locks. Now, z is a good number too, but the rules for 
computing its digits are somewhat more difficult. 

Here is the very nice property of the exponential series: It is good for 
any x, unlike the series for 1/(1 — x) that crashed and burned at |x| = 1. 
This series is always good. You put x = 37 million, you have 1 plus 37 
million, half of 37 million squared, a sixth of 37 million cubed, and so on; 
but don’t worry, these factorials downstairs will eventually tame it down 
and make it converge, and the result will give you e to whatever number 
you chose. Thats something that ’m not proving. 

We all see that Eqn. 16.19 defines a function of x, but why call it 
e raised to a power x? Is the x in the series really a power? You know that 
when integer powers are involved, when you raise a number to a power and 
multiply it by the number to a different power, the product is a number 
to the sum of the two powers. Powers combine. Thus 272° = 2!!. That’s 
true for 2 or any number raised to an integer power; but how do we know 
this is true for e*, especially when x is not an integer? Let us verify this 
property by multiplying the series for e* and e” to see if we get the series 
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for et’, working out the first few powers until we are convinced: 


x2 y 
ee =(1te+ 5...) (1+y+2...) (16.21) 
2 2 2 2 
x x X 
Sit ehh k a ee GS) 
ee 2 2 2 2 
linear quadratic 
x+y) 
er Gaye a prda (16.23) 
= ev, (16.24) 


The proof carries over even if x and y are replaced by complex numbers, a 
result we will be using shortly. 

Now look at cosx. What do I need to know to write the series for 
it at x = 0? We know the cosine of 0 is 1. If you take the derivative you 
get — sin x, and its value at 0 is 0. You take one more derivative and get 
(— cosx), which is —1 at the origin, and so on. Every other derivative will 
vanish, and the surviving derivatives will alternate between +1 to give 


x? xt xê 
n 2n 
x 
=D" Sy (16.26) 


It is clear from the series that cosx is an even function: cosx = 
cos(—x) because only even powers of x appear. Near x = 0, it starts at 1 
and falls down quadratically. So a very useful approximation is 


2 
x 
osta Tie ors (16.27) 


It is not clear from the series that cos x is bounded. If you cut it off 
after some number of terms, it’s not going to work. In the beginning, 1 — ~ 
looks very good as the cosine starts coming down from 1. But eventually 
this approximation will go bad on you and become too negative, but then 
x 


the next term F will turn it around but soon drive the answer to values that 
are too positive, and so on. If, however, you add all the powers, remarkably, 
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you will reproduce this nice function that will oscillate with period 27. It 
is very hard to imagine that the series in Eqn. 16.26 is actually the cosine 
with all its properties, but it is. 

By similar reasoning, 


TEE (16.28) 
mixa a ep ‘ 
n ent 
a1) =. (16.29) 
2 ) (2n+1)! 
It is obvious that this is an odd function, sin(—x) = —sinx, but not 


obvious that it is bounded or periodic. 

The series for sinx and cosx converge for all finite values of x, just 
like the one for e*. 

If you are stranded and bored at some airport, take cosine squared 
plus sine squared using the series and group terms with the same power 
of x. You'll first find a 1 from squaring the 1 in the cosx, and the net 
coefficient of all non-zero powers of x will miraculously vanish. 


16.4 Trigonometric and exponential functions 


Let us introduce, without any preamble right now, the number i = /—1. 
The only property I need is that i? = —1, * = —i, if = +1 , i? = i, and so on. 
Consider the following rather strange object, e™. Now e is some number, 
and, if you want to raise it to a power, say 2, that’s fine. But now we want 
to raise e to a complex power, ix. What does that even mean? Multiply e by 
itself ix times? Well, that definition of powers is no good. But the series for 
e* defines it for all x, and we boldly define e* for even complex values of 
the exponent to be the same series with ix in place of x. So, the exponential 
function is simply defined by the power series and not the notion of raising 
e to some power. Thus 


d 2 
edog — 1 4 dog+ Se +... (16.30) 


This way we can raise e to various things: real numbers, complex num- 
bers, matrices, whatever you want. If you have a pet, you can put that up 
in the exponent. Of course, you've got to be careful. You cannot raise e 
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to dog, because the units don’t match: there’s a dog here and dog? there, 
and so on. So you should divide it by some standard dog, like Presi- 
dent Obama’s dog. Take some standard and divide by it; then you have 
something dimensionless like this: 


(16.31) 


which converges for dogs of any size. 
That’s a fantastic leap of imagination. Let’s consider the series for e: 


GEDDE — (16.32) 
n: 


Using what we know about the powers of i we get the following terms: 


ix . Px? Pæ ixt 
e% =1+ix+ zi + F + x +... (16.33) 
xf 46 x 36 
=[1-5 a telle- gt | 


(16.34) 
which leads to the following all-time hit due to Leonhard Euler: 
e™ = cosx + isin x. (16.35) 


This is a super-duper formula worth memorizing. Life as we know 
it cannot go on without this formula. It says that the trigonometric and 
exponential functions are very intimately connected by the power series. A 
particularly beautiful case of this formula follows if you put x = z: 


e +1=0. (16.36) 


Everybody agrees this has to be one of the most beautiful formulas we can 
imagine, involving all the key numbers in mathematics: z, defined from 
ancient times as a ratio of circle to diameter; i, the mother of all complex 
numbers; e, the base for the logarithm; and finally 0 and 1, from which we 
can build all numbers in binary. 
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Now let’s do the following two other variations, and then we'll move 
on. If I change x to —x in Eqn. 16.35, I get 


e * = cosx — isin x, (16.37) 


using the even/odd nature of cosx and sinx. I now combine Eqns. 16.35 
and 16.37 to obtain 


ix —ix 
cosx = — (16.38) 
eX — ex 
sinx = -u (16.39) 
i 


This means that if you have exponential functions, you can manu- 
facture trigonometric functions out of them, provided youre not afraid to 
use complex exponents. And all the identities about sines and cosines will 
follow from this. For example, if you take cosine squared plus sine squared, 
youre supposed to get 1. Well, you can square the right-hand sides of Eqns. 


16.38 and 16.39 and add them and you will get 1 provided you remember 
eX ex = eilx—x) pa e? =]. 


16.5 Properties of complex numbers 


Now I’m going to do a little more with complex numbers. I introduced you 
to i by saying it’s the square root of minus 1. Complex numbers entered 
our life even though we didn’t go looking for them. You can write down 
equations with real numbers with no intention of invoking anything fancy, 


like this: 
z +1=0, (16.40) 


and you find there is no solution to this equation. You can say, “I want z? = 
—1,’ and you can manufacture a number i with the property i? = —1; and 
then, of course, you can have z = +7 as your answer. So, complex numbers 
arose first in attempts to solve quadratic equations with real coefficients. 
Let me write you a slightly more interesting quadratic equation, 


2+z+1=0. (16.41) 
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Recall the answer 


—-1+/-3 
Renae a (16.42) 
2 
But we don’t know what to do with /—3, so we will write it as /—1-./3 
and end up with 


1 
oe ee (16.43) 
2 5 


These are formally solutions to Eqn. 16.41 in the following sense. 
Take one of the roots, say the one with the + sign; put that into Eqn. 16.40, 
and it will work: 


1 3 
z tatim (54 Si) PE (16.44) 
oi on NE 
~4 2 -2 


af 3 
A) -i+ eo (16.45) 
1 3 3 
oe D (16.46) 
4 2 4 i 2 


All you have to know in these manipulations is that i? = —1. Using this one 
property, you can now solve any quadratic equation. People realized that 
if we enlarge numbers to include complex numbers, then we can solve any 
n-th order polynomial equation and obtain n roots. If it’s quadratic, it'll 
have two roots; if it’s cubic, itll have three roots. The roots may be com- 
plex even if the coefficients in the equation are all real. Because complex 
numbers arose from equations with real coefficients, will the equations 
with complex coefficients perhaps lead to even crazier numbers? Luckily 
or unluckily, this is not so: a complex polynomial equation of degree n 
with complex coefficients will have n roots, generally complex. 
Now, a very important point to notice is that this whole thing, z = 
i+ Bj i, is a single complex number. Don’t think of it as the sum of two 
eao it cannot be simplified any further, just as a two-dimensional 
vector V = 2i + 3j cannot be simplified further. 
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y 


Z=x+ iy 


© 7*=x-iy 


Figure 16.2 The complex plane with a typical point z = x + iy and its conjugate 
z* = x — iy. In polar form, z is assigned a length r and an angle 0. We will see that 
z= re, Note that we measure y and not iy along the y-axis. 


We are going to generalize this particular case and introduce now a 
complex number z as follows: 


zZ=xt+yy. (16.47) 


Just as x stood for a generic real number, z stands for a generic complex 
number. However, z has two parts: x, called the real part, and y, called 


the imaginary part. In the root of the quadratic equation -1 + Vi, the 
3 


real part is -4 and the imaginary part is ~. Were going to visualize the 
complex number z = x + jy as a point in the complex plane as shown 
in Figure 16.2. Note that the point (x,y) stands for z = x + iy. We intro- 
duce a related complex number known as z*, pronounced z-star, called the 
complex-conjugate of z and given by 


Z=x-iy. (16.48) 


It’s obtained from z by changing the sign of i, or reflecting z on the real 
axis. 
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How does one add, multiply, and divide complex numbers? First, if 
zı = x1 + iyı and z2 = x2 + iy2, we will define their sum as follows: 


z +2. = (x +22) + iy +y), (16.49) 


which is really just like adding vectors. Subtracting is similar. 
If you want to get the real or imaginary part of z, you do the 
following: 


z+z2* 
R =x= 
elz] =x 3 
Imz] = y = T (16.50) 
i 


Of course, given z = x + iy, you can spare all this and read off the real 
and imaginary parts by inspection. However, you will soon need the real 
and imaginary parts of more complicated expressions. In general, if you 
are given some expression f, its complex conjugate f* is obtained from f by 
changing every i to —i and leaving all real numbers alone. For example, if 


f =(cosA+ isin B)? = cos? A — sin? B+2icosAsinB (16.51) 
then 
f* = (cos A — isin B)? = cos? A — sin? B — 2icos Asin B, (16.52) 


assuming A and B are real. The real and imaginary parts of f may be read 
off by inspection or generated by the more formal prescription 


x 
Re[f] = ves ) (16.53) 
*j 
Im f] = U= ay (16.54) 
What is zı times z2? Just open the brackets and remember i* = —1. 


Thus 
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z: z2 = (x1 + iy) (X2 + tyr) (16.55) 

= x1 X — Yiy2 + i(x1y2 + y1%2). (16.56) 

Re[Z 22] = x1 x2 — y1 y2 (16.57) 
Im[z 22] = x1y2 + y1 %2. (16.58) 


Something very nice happens when you multiply z by z*: 
zz* = (x+ iy)\(x— i) = x? +y = |zļ?. (16.59) 


We refer to 


|z| =x? + y? (16.60) 


as the modulus of the complex number z, which is also the length r of z in 
Figure 16.2 by Pythagoras’s theorem. 
I'm going to use z* to help divide zı by z2. What is 


A AT, (16.61) 

z2 Xx+ 
IfI had only xz in the bottom, I could divide x; and iyı by it, because it is an 
ordinary real number, but now I have to divide by x2 + iy. The trick is to 
multiply the top and bottom by the complex conjugate of the denominator 
and proceed as follows: 


zoo atiy a+ iy) -— i) 
2 atiy (+i) — i) 
— ti) 2) 
o |z21? l 


(16.62) 


(16.63) 


The denominator is an ordinary real number, |z2 |2, while in the numera- 
tor you can open out the brackets. Remember that if you have a complex 
denominator, and you don’t like it, multiply top and bottom by the com- 
plex conjugate of the denominator, and the new denominator will become 
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a purely real (positive) number. This whole thing, |z2|*, could be 36, for 
example. So, dividing by a complex number is not a problem. 


16.6 Polar form of complex numbers 
Now you're going to use Euler’s result 


e” = cosx + isinx. (16.64) 


Let’s take the complex number z = x + iy in Figure 16.2. Let us intro- 
duce, just as we would for an ordinary vector, the angle 0 and the length r, 
which is just |z|. Whenever you have a vector, you can talk about the Carte- 
sian components x and y, or you can talk about the polar components, the 
length r and the angle @ it makes with the x-axis. Their interrelationship is 
as follows; 


x=rcos@ y=rsind (16.65) 
r=, x+y? 8 =tan! Ë. (16.66) 
x 
Let us do the same starting with the Cartesian form of z and write 
z= x+ iy = rcos + irsin = r(cos0 + isinĝ) = re”, (16.67) 


which is called the polar form. One refers to r as the amplitude of z and 0 
as the phase of z. The complex conjugate is 


z* = re and (16.68) 


|z|? = zřz= re? 3 re 8 = r. (16.69) 


Note that the modulus z is the same as the amplitude r: 


|z| =r =,/ x? +y. (16.70) 


The Cartesian and polar forms both describe the same number. One 
displays transparently the real part and imaginary parts; the other, its am- 
plitude and its angle relative to the real axis. It is easier to add complex 
numbers in the Cartesian form (just add the real and imaginary parts), 
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while the polar form is best for multiplying and dividing, as I will show 
now. 

Consider the product of two complex numbers, z; = re and z = 
re, It is 


az = re re = rnet, (16.71) 


If you want to multiply two complex numbers, multiply the lengths to get the 
length of the product and add the phases to get the phase of the product, as 
shown in Figure 16.3. It’s a lot easier to multiply them in this form than in 
Cartesian form, as we did in Eqn. 16.56. 

If the polar form is well suited for multiplying, it’s even better suited 
for dividing: 


0 
z re 
z = E (16.72) 

r : 

= + eii), (16.73) 
n2 
Multiplication in the complex plane 
X 


Figure 16.3 The rule for multiplying zı and z2: add the phase angles 0) and 62 
to get the phase of the product and multiply the amplitudes rı and r to get the 
amplitude of the product. 
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To divide by a complex number, divide by its modulus and subtract its phase 
from the phase of the numerator. 

Here is what I want you to carry in your head, because it’s very, very 
important. Every complex number has a length and a direction. When you 
multiply by a second complex number, you're able to do two things at the 
same time. Youre able to rescale it, and youre able to rotate it. You rescale 
by the length of the second factor, and you rotate by the phase angle carried 
by the second factor. The fact that two operations are done in one shot is 
one of the reasons complex numbers play an incredibly important role in 
physics, as they do in engineering and mathematical physics. I also find 
imaginary numbers useful when computing my tax deductions. 


CHAPTER 17 


Simple Harmonic Motion 


Were now going to study what are called small oscillations, or simple 
harmonic motion. Take any mechanical system that is in a state of equi- 
librium. Equilibrium means the forces on the body add up to zero. It has 
no desire to move. If you give it a little kick, a push away from the equilib- 
rium point, what will happen? There are two main possibilities. Imagine 
a marble on top of a hill. That is in unstable equilibrium because if you 
give the marble a nudge, it will roll downhill and never return to you. The 
other possibility involves stable equilibrium: if you push the system away 
from equilibrium, there are forces bringing it back. The standard example 
is a marble in a bowl: when it is shaken from its position at the bottom, it 
will rock back and forth until it settles again. A rod hanging vertically from 
the ceiling from a pivot, when pulled to the side and released, will swing 
back and forth. These are examples of simple harmonic motion, which 
results whenever any system is slightly disturbed from stable equilibrium. 

The example that we’re going to consider is a mass m, resting on 
a table, connected to a spring, which in turn is connected to the wall. 
The spring is not stretched or contracted; the mass is at rest, as shown in 
Figure 17.1. That’s what I mean by equilibrium. Now let it be displaced by 
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Figure 17.1 The mass m rests on a table and is connected to a spring of force 
constant k, which is anchored to the wall. The displacement from equilibrium is 
denoted by x. It is positive in the figure but it could also be negative if the mass 
were to be displaced the other way. 


an amount x from this point of equilibrium. The spring force is F = —kx 
and Newton’s law says 


— = —kx. (17.1) 


If the mass strays to the right, x is positive and — kx is to the left, so as to 
send it back toward its equilibrium position. If x is negative, the restoring 
force is positive, again pointing to the equilibrium position. 

We want to understand the behavior of such a mass. How do we 
solve this problem? Our job is to find the function x(t) that satisfies this 
equation, which we rewrite as follows: 


a 
= =—a’x (17.2) 
k 
@=,/—. (17.3) 
m 


You can make it a word problem and say, “Pm looking for a func- 
tion whose second derivative is minus itself, except for this number o> 
Trigonometric functions have the property that if you take two deriva- 
tives, they return to minus themselves. So, you can guess that x = cost 
but it won’t work, as I showed you before. On the other hand, the 
guess 


x(t) = Acoswt (17.4) 
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will obey this equation. While A is clearly the amplitude, œ is related to 
the frequency of oscillations as follows. If I start at t = 0 when x = A, how 
long do I have to wait until it comes back to A? I have to wait a time T, 
such that 


oT=2n (17.5) 


because that’s when the cosine returns to 1. That means the time that I 
have to wait is 


T=—. (17.6) 
w 


You can rewrite this as 


2m 
=S] 17:7 
(0) T af (17.7) 


where f = + is what we would normally call frequency, which is how many 
oscillations it completes per second. It is the inverse of the time period. In 
physics talk, frequency usually means øw. 

So, if you pull a mass and let it go, it oscillates with a frequency that 
is connected to the force constant and the mass. If the spring is very stiff 
and k is very large, the frequency is very high. If the mass is very big and 
the motion is very sluggish, f is diminished. So, all that stuff you expect 
intuitively is quantified by the solution to the equation, but there is more. 
For example, it is not intuitively obvious that if you make the mass four 
times as big, you will double the time period. 

One remarkable part of the solution is that you can pick any A you 
like without changing œ or T. Think about what that means. The ampli- 
tude A is the amount by which you pulled the mass when you let it go. You 
find that whether you pull the spring by one inch or by ten inches, it takes 
the same time to finish a full back-and-forth motion. If you pull it by two 
inches, compared to one inch, it has a longer way to go. But if you pull 
it by two inches, the spring is going to be that much more tense, and it’s 
going to exert a bigger force so that it will go faster for most of the time; 
that’s very clear. But the fact that it goes faster in exactly the right way to 
complete the trip in exactly the same time is rather a miraculous property 
of Eqn. 17.2. If you tamper with it, if you add to the force even a tiny extra 
term, say proportional to x°, then this feature is gone. It’s like saying that 
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planets move around the sun in closed elliptical orbits only under the 4 


force. It is not true if the force falls as TT 

Now consider the following variant of this solution. You set your 
clock to 0 at the origin in the graph. Suppose I set my clock to 0, right 
there on the dotted vertical line in Figure 17.2 at t = 5. When my clock 
says 0, x is not at the maximum; it vanishes. But it’s the same physics, and 
it’s the same equation. Where, then, is the solution that describes what I 
see? It is there and it comes from the fact that we had the latitude of adding 


a certain angle ¢, called a phase, to the solution: 


x(t) = Acos[w@t+¢]. (17.8) 


Your choice is ø = 0 and mine is é = 5. You can verify that whatever we 
pick for ¢, the above x(t) will be a solution because two derivatives of the 
solution with the ¢ is also —@* times itself. And, whatever you pick for 
A, it will still work, because A cancels out of both sides in Eqn. 17.2. So, 
whenever you have an oscillator, say, a mass and spring system, and you 
want to know what x is going to be at all times, it is not enough to know 
that it obeys Eqn. 17.2; you need to know the amplitude and the phase. 
These are determined by knowing two things about the solution, which is 
usually the x and v at some time, usually t = 0. For this reason we refer to 
them as initial value data. 
Let me give you an example. Suppose an oscillator has x(0) = 5 and 
velocity v(0) = 0, at t = 0. What does that mean? I pulled the mass by 5 
and I let it go. I give you the values of the spring constant k and the mass 


m, and I say, “What’s the future x?” 
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Figure 17.2 The function Acosqt for the case A= 4, œ = 1. The dotted vertical 
line is another possible way to set the clock to zero, another choice of phase, 


namely ¢ = 5. 
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First observe that the velocity corresponding to our solution Eqn. 
17.8 is 


v(t) = —@Asin(@t+ ¢). (17.9) 
So we know two things at t = 0: 


5 = Acos[0+ 4] (17.10) 
0=—Aosin[0+ 4]. (17.11) 


The second equation gives us two choices: either A = 0, which is a trivial 
solution, or ¢ = 0, which lets A survive this test. The first equation with 
$ =0 gives A= 5 leading to the solution x(t) = 5 cos œt. This is a problem 
where we did not need a non-zero ¢. But it could have been that when you 
joined the experiment, you were somewhere to the right of the origin, on 
the vertical dotted line in Figure 17.2, when you set your clocks to zero. 
Then you would have, as your initial conditions, x = 0, v = —5a@, which 
means ¢ = 5. Of course A = 5 as before. 

Let us agree that, if there’s only one oscillator, it is perverse to set your 
clock to 0 at any time other than when the oscillator is at its maximum 
displacement, so that 


x(t) = Acosat. (17.12) 


(If there are two oscillators oscillating out of step, it’s impossible to make 
$ = 0 for both of them: you can set your clock to 0 when one of them is 
at a maximum, but then the other may not be at its maximum.) Going 
forward, remember that the velocity and acceleration are, for all future 
times, 


v(t) = —wAsinot (17.13) 
k 
a(t) = -œ Acosat (- —— x(t) in accordance with F = ma). 
m 
(17.14) 


So the velocity also oscillates sinusoidally but with an amplitude wA. 
The acceleration also oscillates but with an amplitude w*A. These two 
results are true for any phase ¢. 
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Let us explicitly verify the law of conservation of energy. Consider the 
total energy: 


1 1 
E(t) = -m + -kx? (17.15) 
2 2 
1 1 
-` zmo A? sin? wt + 5 kaY cos? wt (17.16) 
1 
= 5 kAY because w? = x. (17.17) 


Thus, by magic, the time-dependent terms sin’ wt and cos? wt have 
the same coefficient, and you find that E(t) actually does not depend on 
time at all. Even though position and velocity are constantly changing, 
this combination will not depend on time. At the instant when the mass 
has reached one extremity and is about to swing back, it has no veloc- 
ity; it only has an x = A, and the energy of the oscillator is all potential 
energy, 5 kA’. 


17.1 More examples of oscillations 


If a body is in stable equilibrium, and you disturb it, it rocks back and 
forth, executing simple harmonic motion. The standard textbook example 
is the mass on a spring, which we just studied. But it is a very generic 
situation, as shown in Figure 17.3. Skipping the mass-and-spring example, 
let us go to the top right, where we have a beam hanging from the ceiling by 
a cable that is fixed to its center of mass (CM). If you twist it by an angle 0, 
it will try to untwist itself. Now we don’t have a restoring force but we have 
a restoring torque. What can be the expression for the restoring torque t? 
When you don’t do anything, the cable doesn’t do anything, so t vanishes 
when 0 = 0. If 0 Æ 0, it is some function of 0, and the leading term in the 
Taylor expansion would be proportional to 0: 


t(0) = —kd. (17.18) 


The coefficient x is the torsion constant, and the minus sign tells you it’s 
a restoring torque. That means if you make @ positive, the torque will try 
to twist you the other way. The torsion constant, which is the restoring 
torque per unit angular displacement, is to rotations what the spring con- 
stant was to linear oscillations: the restoring force per unit displacement. 
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mg mg 


Figure 17.3 Some examples of simple harmonic motion. The left shows the 
eternal favorite, the mass and spring; top right is a beam hanging from the ceiling 
by a cable; the bottom left and right show a physical pendulum supported at the 
pivot P, when it is in equilibrium, and when it is displaced by an angle 8. The 
vector g represents the downward gravitational field of magnitude 9.8m/3?. 


You have to find this x, which is not given, the way k is. Once you do, you 
can say 


I—- = —K0 (17.19) 


where I is the moment of inertia of the beam about the point of suspen- 
sion. 
Mathematically, this equation is identical in form to 


— = —kx (17.20) 


with the substitution x —> 8, m —> I, k > x. So the answer follows: 


O(t) = Acosat (17.21) 


= /-, 17.22 
7) Fi ( ) 
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The mass-spring system executes linear oscillations, while the beam 
executes angular oscillations. Another example of the latter is the simple 
pendulum. The pendulum has a bob of mass m hanging by a massless rod 
of length /. If you let it hang vertically, it will stay that way forever. No 
torque, no motion. Suppose you pull it by an angle @ and release it. To 
predict the future, you need to find J and x. Now I is easy: for a single 
mass mat a distance / from the pivot point, I = mI. To find x, you need 
to find the restoring torque per angular displacement. If you displace by 0, 
the torque about the pivot point is 


t = —meglsind ~ —meglo (17.23) 


where I have approximated sin@ by @, which is the leading term in the 
Taylor expansion. With just this term, we can read off x: 


p= = = mgl. (17.24) 


So 


K mgl g 
es ee 17.2 
C\VI V me V1 A 


from which follows the familiar formula 


2 i 
ra aoe ji (17.26) 
o g 


Notice that if you displace the pendulum by large angles, when sin@ cannot 
be approximated by 0, the frequency will no longer be independent of the 


amplitude. 
Note that finding œ took some work. You had to disturb the system 
from equilibrium and find the restoring torque per unit angle x = — 7 and 


also compute I, whereas in the case of the mass-spring system, you were 
simply given m and k. In the case of the twisted cable, x will be given to 
you, because computing it from first principles requires work beyond the 
scope of this course. 

Let us move from a pendulum with all the mass concentrated in the 
bob to a physical pendulum, some irregularly shaped flat planar object, as 
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shown in the middle of the second line of Figure 17.3. You drive a nail 
through it at some pivot point P and hang it on the wall. It will come to 
rest in a certain equilibrium configuration. Think about where the center 
of mass will be. It will lie somewhere on the vertical line going through 
P—otherwise the force of gravity, which is effectively acting at the CM, 
will produce a torque around P. 

Let us look at the forces. This body, when hanging in its rest position, 
has two forces on it: the nail, which is pushing up, and the weight of the 
body mg, which is pushing down, cancel each other. The nail will keep it 
from falling. The nail will not keep it from swinging, because the force of 
the nail, acting as it does at the pivot point, is unable to exert a torque, 
whereas the minute you rotate the body, mg is able to exert a torque, as 
is clear from the figure. That’s why if you rotate it and let go, it will start 
swinging back and forth. 

What will the torque be? It will be the same as before: —mglsin6, 
where / is now the distance between the pivot point and the center of mass. 
As far as the torque is concerned, it’s as if all the mass were sitting at the 
CM. But the moment of inertia is not as if all the mass is sitting at the CM, 
in which case it would be ml*. So don’t make that mistake. All the mass 
is not sitting at the CM; it is all over the place. The moment of inertia is 
I = Icy + mË? by the parallel axis theorem, where Icy is hard to compute 
for an irregular object. 

So, every problem that you will ever get will look like one of these two. 
Either something is moving linearly with a coordinate that you can call x, 
or something is rotating or twisting by an angle you can call 0. And if you 
want to find out the frequency of vibration, you have to disturb it from 
equilibrium—by pulling the mass, twisting the cable, or displacing the 
pendulum from its equilibrium position—in order to find the restoring 
force or torque per unit displacement. 


17.2 Superposition of solutions 


I will now go over more complicated oscillations using some of the 
formulas we learned in the last chapter. Here is the most important one: 


e? — cosO + isinð. (17.27) 


This is a formula worth memorizing. You should realize that given any 
expression involving complex numbers, you can get another equation by 
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taking the complex conjugate of both sides, where every 7 is changed to 
minus 7. That will give you 


e7” —cos6 — ising. (17.28) 


This is true because if two complex numbers z1 = x; + iy; and z2 = x) + iy2 
are equal, then the real and imaginary parts are separately equal, and so are 
their complex conjugates: zi = x, — iy) = x2 — iy) = z;. The two previous 
equations can be inverted to give 


10 —i0 
cosĝ = a (17.29) 
io: = 
sino = E. (17.30) 
21 


You don’t need trigonometric functions once you have the exponen- 
tial function, provided you let the exponent be complex or imaginary. This 
is one example of unification. People always say Maxwell unified this, and 
Einstein tried to unify that. Unification means things that you thought 
were unrelated are, in fact, related, and they are different manifestations 
of the same thing. When we first discovered trigonometric functions, we 
were thinking right-angle triangles, opposite sides and adjacent sides, and 
so on. Then, we discovered the exponential function, which, by the way, 
was used by bankers who were trying to calculate compound interest con- 
tinuously at every instant. The fact that those functions are related is a 
marvelous result, but it emerges only if you invoke complex numbers. 

Finally, remember that there are two ways to write a complex num- 


ber: 
z=xtiy=re” =|z\e”. (17.31) 
Now we use the new tools to attack the familiar equation 


E i (17.32) 
x= =—-0x% = @=1/— : 

dt? 2 : m 

where the second derivative of x is written as X, and wo, the natural 
frequency of vibrations of the oscillator, has been given a subscript to 
distinguish it from other @’s that will arise shortly. Earlier we solved this 
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equation by turning it into a word problem: “What is the function, x(t), 
with the property that two derivatives of the function look like the func- 
tion itself, except for a proportionality constant?” We racked our brains 
and we remembered that sines and cosines had this property. One deriva- 
tive is no good; it turns sine into cosine and vice versa. Two derivatives 
bring back the function you started with, which is why the answer could 
be sines or cosines. But now I’m going to solve the equation a different 
way. I know a function that is even better—it reproduces itself when it is 
differentiated once. If so, it’s obvious that its 92-nd derivative will also look 
like the function. But recall why we rejected 


x(t) = Ae’. (17.33) 


I want to get something proportional to —x(t) upon taking two 
derivatives, and this does not do it: I get +x(t). It does not help to try 
something like Ae~‘ because after two derivatives I again get +x(t). So 
this function is no good. Also, it doesn’t look like what I want. Even with- 
out doing much work, I know that if I pull this spring it’s going to go back 
and forth, whereas these functions are exponentially growing or they’re ex- 
ponentially falling; they just don’t do the trick. But now we have a way out: 
let the exponent be complex. 

We are going to make a guess, called an ansatz in the business: 


x(t) = Ae** (17.34) 


where we will now allow a to be some general complex number. 

The ansatz is a tentative guess with some parameters, A and a in 
this instance, the judicious choice of which may yield a solution. If youre 
lucky, it will work. If not, you move on and try another solution; it is just 
like speed dating. 

So, let’s take the ansatz in Eqn. 17.34, put it in Eqn. 17.32, and 
demand that it be satisfied: 


X+apx=0 (17.35) 
a? Ae"! + 2 Ae™* = 0 (17.36) 
Ala? + ap )e“' =0. (17.37) 
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Our ansatz will work if we manage to get (Ae*")(a* + >) to vanish. 
How many ways are there to kill this beast? The choice A = 0 is called the 
trivial solution and corresponds to the oscillator sitting still forever. So A 
can be anything, except 0. Now e®! never vanishes (even if a is complex), 
as here so it is not the cause of the zero. So it must be that 


a? + wo =0 which means a = tia. (17.38) 


(More generally, if in place of e**, which never vanishes, we had any func- 
tion that did not vanish identically, we can still cancel it by picking a time 
when it is non-zero, and deduce Eqn. 17.38.) 

So now I have two solutions of the form Ae®*. For A you can pick any 
number you like, in fact, real or complex—it doesn’t matter. The equation 
is satisfied. But a can be only one of two numbers: +ia. 

How do we choose between the two solutions 


x4(t) = Ae" and (17.39) 
x_(t) = Ae io"? (17.40) 


It turns out that we can pick both, and I'll tell you what I mean 
by that. Let us begin with the fact that Eqn. 17.32 is a homogeneous, lin- 
ear differential equation. Pll have to tell you what that means through an 
example: 


96 3 


dx 
+16 — +2x=0. (17.41) 


17 dx96 dx? 


It is homogeneous because you only find a single power of x anywhere, 
which happens to be the first power here. It is a linear equation because you 
find either the function x or its derivatives, but never the squares of cubes 
or higher powers of x or the derivatives. Note that the 96-th derivative does 
not change this fact; it is still the 96-th derivative of x and not, say, x. By 
contrast, 

dx 


r +3x = 0 (17.42) 


is a non-linear equation because of the x? term. A linear equation has a 
very important property that lies at the heart of so many things we do. 
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This is called the principle of superposition, and it states: if xı (t) and x2(t) 
are two solutions of a homogeneous linear equation, then so is any linear 
combination of them with constant (t-independent) coefficients A and B: 


x(t) = Axı (t) + Bx2(t). 


Let us prove this for the oscillator case to understand where linearity 
comes in. Given 


X + wx, =0 (17.43) 
% +0% =0, (17.44) 


let us multiply the first by a constant A, the second by B, and add 
to get 


AX, + Aa@px, + BX, + Bokx = 0 (17.45) 
d? (Ax, + Bx 
cae) + a) (Ax; + Bx2) = 0, (17.46) 


which clearly shows that x(t) = Ax) (t) + Bx2 (t) is also a solution. We used 
the fact that any derivative of a linear combination is the same linear com- 
bination of the derivatives and that the non-derivative term was linear in 
x. Try doing this for the non-linear case, say Eqn. 17.42, and you will find 
it does not work because 3Ax;(t)? + 3Bx (t)? 4 3 (Axı (t) + Bx2(t))?. 

The bottom line is that if you give me two independent solutions 
to a homogeneous linear equation, I can manufacture an infinite number 
of solutions because I can pick A and B any way I like. The solutions x, 
and x are like unit vectors i and j, whose linear combinations with all 
possible coefficients yield an infinite number of vectors in two dimensions. 
A word of caution: i and 3i are also two vectors, but by combining them 
you can only get solutions parallel to i. These two vectors are said to be 
linearly dependent, which in this simple case means one is a multiple of the 
other. Likewise e“' and 5e’* cannot be used to build anything other than 


multiples of e*t. However, e~' is an independent object because it is not 
a multiple of e 


iat 
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By the same analogy with i and j, if a linear combination of two 
linearly independent functions equals another linear combination, the 
coefficients have to match on both sides. Thus 


Ae*! + Be*! = Ce” + Der! implies (17.47) 
A=C B=D. (17.48) 


17.3 Conditions on solutions to the harmonic oscillator 


Let us then consider the general solution 
x(t) = Aet + BeW 1008, (17.49) 


How do we decide what A and B are? In general they are arbitrary. 
But on a given day, when you pull the mass by 9 cm and release it from 
rest, A and B have to be chosen so that at t = 0, x(0) = 9 and the veloc- 
ity v(0) = 0. But I have a bigger problem. The answer is manifestly not 
real, and we know x is a real function. That is not a mathematical re- 
quirement of the equation, but a physical requirement. To say that x is real 
means the following. A complex number x + iy has a complex conjugate 
x — iy, and the property of real numbers is that when you take the com- 
plex conjugate, nothing happens: it satisfies the condition z = z*. There 
is no imaginary part whose sign you can flip. Real numbers are their own 
complex conjugates. 

So, Pm going to demand that this solution, in addition to satisfying 
the basic equation, also is real. To do that, Pm going to demand x(t) equals 
its complex conjugate x* (t): 


x*(t) = A*®e 100! 4 Beet ioot — x(t) = Ae" + Be (17.50) 


To find x*(t) given x(t), I conjugated everything in sight. The com- 
plex conjugates of A and B became A* and B*. The complex conjugate of 
eti@ot is gio! and vice versa, because the i goes to minus i while t and wo 
are real numbers and nothing happens to them. 

So x(t) = x* (t) for all times t, if the coefficients of e+!" in Eqn. 


17.50 match: 


A=B* B=A*. (17.51) 
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However, if A = B*, then B = A* follows automatically because both 
are saying the same thing: A and B have equal real parts and opposite imag- 
inary parts. This can also be seen another way. Conjugating both sides of 
A= B*, we get A* = (B*)* = B because conjugating any complex number 
twice changes the sign of its imaginary part twice, which is equivalent to 
doing nothing: (z*)* = z. 

The reality of x then leads to the solution 


x(t) = Ae! + A* eTit, (17.52) 


In other words, B is not an independent number; it has to be the complex 
conjugate of A if x is to be real. I hope you can see at a glance that the 
solution above is real, because whatever the first animal is, the second is 
its complex conjugate and has the opposite imaginary part. When you add 
them, the answer will be real. But A is not necessarily real. In polar form it 
has a modulus |A| and a phase ¢, so that 


x(t) =|Ale!? imo! $ JAļe7 tot = [Ale etoot) + [Ale to +o8) 
= JAJ [At 4 einn], (17.53) 


Now, what is this function I have in brackets? You should be able to 
recognize this creature as a cosine. We have ended up with 


x(t) = 2|A| cos(@ot + ). (17.54) 


This describes an oscillator of amplitude 2|A| and phase #. Notice 
how the amplitude and phase of the oscillator were encoded in a single 
complex number A. 

Suppose you had chosen to use sin wot and cos wot as the two basic 
solutions instead of e+", The general solution would have been 


x(t) = Acos wot + Bsin wot (17.55) 


where A and B are arbitrary. However, demanding that x be real will force 
them both to be real. No matter how you slice it, a physical oscillator 
will have in its solution just two free parameters: they could be two real 
numbers A and B as above or one complex number A = |A| e’? as before. 
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Now, this is a long and difficult way to get back the old answer. Your 
reaction may be, “We don’t need these complex numbers. We have enough 
problems in life; we’re doing well with sines and cosines, thank you.” But 
now I’m going to give you a problem where you cannot talk your way out 
by just turning it into a word problem. 


17.4 Exponential functions as generic solutions 


Here is the problem: a mass m, connected to a spring of force constant k, 
is moving on a surface with friction. The minute there is friction, you have 
an extra force. We know that if youre moving to the right, the force of 
friction is to the left, and, if you are moving to the left, the force is to the 
right, that is, the frictional force is velocity dependent. The equation that 
crudely models this velocity dependence is 


mx = —kx — y mx (17.56) 


where I include a factor m in the frictional coefficient to simplify subse- 
quent algebra. Dividing by m, our equation becomes 


K+ ypx+ wrx =0. (17.57) 


Can you solve this as a word problem: It’s going to be difficult, be- 
cause you want a function that, when you take two derivatives, add some 
amount of its own derivative, and then some of itself, gives zero. It is not 
clear a trigonometric function can do that. However, an exponential has 
to work because it reproduces itself no matter how many derivatives you 
take. Thus we make the ansatz 


x(t) = Ae*®, (17.58) 


Note that I do not explicitly use a complex exponential. If a is meant to 
be complex, it will come out that way; we are not forcing it to be real in 
making this ansatz. When we feed it into Eqn. 17.57 we find, because every 
derivative brings a factor of a, 


Ala? + ya + ok)” =0. (17.59) 
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Once again, A cannot be the cause of the zero, because if A vanishes you've 
killed the whole solution and e®* is not going to vanish, so the only way is 
for the stuff in brackets to vanish: 


(a? +yat+ap) =0. (17.60) 


That means the a that you put into this guess must be one of the 
roots 


—y £,/y2— 405 2 
a+ = ae 7 — ok. (17.61) 


2 


The general solution is 


x(t) = Ae”+* + Be*-* (17.62) 


yo Jy? 
+ Bexp| | -5-7 77% t|. (17.63) 


The motion described by the solution depends on the value of T 


17.5 Damped oscillations: a classification 


Let us classify the different kinds of behavior that emerge as we vary zy 


17.5.1 Over-damped oscillations 


We first consider the over-damped case 


Ls Gy, (17.64) 

2 
In this case both roots a+ are real and both are negative: a— is negative 
being a sum of two negative numbers, while a+ is negative because the 
positive square root is smaller than y /2. This means that x(t —> œo) —> 0, 
which is in accord with our expectation that friction will eventually bring 
the oscillations to an end. 
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How about A and B? First of all, they are both real as can be seen by 
equating x(t) to its conjugate. Because the exponentials are real they do 
not respond to conjugation and we require A = A* and B= B*. 

To find A and B, we need two pieces of data, which I will take to 
be initial position, x(0), and the initial velocity, v(0). If we put t = 0 in 
Eqn. 17.62 we find 


x(0) = A+B. (17.65) 
Next I take the derivative of Eqn. 17.63 and then set t = 0 to find 
v(0) = Aa, + Ba_. (17.66) 


Solving these simultaneous equations will yield A and B. To test yourself, 
try showing that if the oscillator is displaced to some x(0) > 0 and released 
from rest, that is, v(0) = 0, then x(t) never becomes 0 and hence cannot 
become negative. This means the mass will simply relax to its equilibrium 
position without any oscillations. 


17.5.2 Under-damped oscillations 


In turning on friction we got carried away: from being 0 in the very first 
example, y jumped to a value greater than 2m. Consider now the inter- 
mediate case when 0 < y < 2q@. What do the solutions look like now? We 
should be able to guess that, at least for very tiny values of y, the oscilla- 
tor will oscillate as before, but with a slowly diminishing amplitude. Let us 
verify and quantify this expectation. 

The roots now become 


= 7 + 7? 2 
ay a. u (17.67) 
2 
DSLT A (17.68) 


(17.69) 
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We have introduced yet another frequency 


2 
o =o - — < o, (17.70) 


which describes the oscillatory part of the motion. Note that the roots are 
complex conjugates 


a, =a“ (17.71) 

and the general solution becomes 
x(t) = Ae”+* + Be*-! (17.72) 
= 727" [Aet 4 Bee], (17.73) 


I leave it to you to verify that once again x = x* implies A* = B because the 
A and B terms get exchanged under complex conjugation. Repeating the 
analysis for the case y = 0, this solution may be rewritten as 


x(t) = Ce~2?* cos[o't +g] where (17.74) 
C=2|A| and A=|Ale?. (17.75) 


Figure 17.4 shows what the damped oscillation looks like for A = 2, 
y = 1, and œ = 27. This is typically what you will see if you excite any 


Figure 17.4 Damped oscillations with x(t) = 4e~>' cos(2z t), i.e A= 2, y = 1, 
and œ' = 27. The falling exponential shows the decay of the amplitude. 
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system with some modest amount of frictional loss. If y is very small, you 
may not realize the oscillations are being damped. 


17.5.3 Critically damped oscillations 


Having considered the cases y > 2@ (over-damped) and y < 2m  (under- 
damped), we turn to the critically damped case y = 2p. In this case a4 = 
a— = —4. Where is the second solution to accompany Ae~ 7? We know in 
every problem there must be two solutions, because we should be able to 
pick the initial position and velocity at will. That’s an area of mathematics 
I don’t want to enter now, but you can verify that the second solution is 
Bte~ > , which is not a pure exponential. You will find the derivation of this 
solution in my math book. The general solution for the critically damped 
case is thus 


x(t) = e77 [A+ Bt]. (17.76) 


Try to show in this case that A = x(0) and B = v(0) + 4x(0). 


17.6 Driven oscillator 


Next we turn to a more challenging problem. I have, as before, the mass, 
the spring, and friction. But now I’m going to apply an extra force, 
Focosæt. This is called a driven oscillator. Imagine that I am actively shak- 
ing the mass with my hand, exerting the force Fọcosæœt. Now there are 
three w’s: wp = = , the natural frequency of the undamped free oscillator; 
wo! =; ok — a which entered the under-damped oscillator; and finally œ, 
the frequency of the driving force, which is completely up to me to choose. 
The equation to solve is 


mx + y mx + kx = Focosat, (17.77) 
which we rewrite as 
i . 2 Fo 
Xx+ yx+a@px= — cosat. (17.78) 
m 


This problem is difficult because you cannot guess the answer to it by 
turning it into a word problem: neither x(t) « cos(œwt) nor x(t) œ sin (wt) is 
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a good ansatz because you cannot have all four terms be functions of the 
same kind as the ansatz. In fact, only an exponential can lead to all four 
terms being the same functional form (exponential) because taking any 
number of derivatives will leave it alone. But our driving force is a cosine 
and not an exponential. 

Here is a clever trick to beat this problem. Recall that with no driving 
force, if 


ï +y +a5x1=0 and (17.79) 
+ y % + jx =0, (17.80) 


then multiplying the first by a constant A and the second by a constant B 
and adding, we found that Ax; + Bx was also a solution: 


d [Ax + Bx] d[ Ax, + Bx] 2 
J2 +y T +o [Axı + Bx] =0. (17.81) 
I have used the fact that the derivatives of a linear combination Axı + Bx) 
is the same linear combination of the derivatives. 
Suppose now that there is a driving force behind x; and x3: 


: Fy(t 
ï +y å + oom = © (17.82) 
‘i : F(t 
Ry + y Xy + ofxy = = (17.83) 
It follows by the same manipulations that 
a’ [Axi + Bx] d| Ax, + Bx] 2 
Wn +y T + a [Ax + Bx] 
Filt F(t 
E AA (17.84) 
m m 


In other words, in a linear equation, the response to a linear combination of 
forces is the corresponding linear combination of responses. 
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Now for the trick. Let x(t) be the solution to 

we . j Fo 

X*+ y+ wx = — coswt (17.85) 
m 

and y(t) the solution to 

as . 2 Fo : 

ý+ yý + oy = — sinoat. (17.86) 
m 


(We could call these two solutions x, and x, but there is a reason for this 
nomenclature.) Multiply the second equation by i and add it to the first to 
obtain 


a |x + iy] H d|x + iy] 


F 
+o [x+ iy] = — (cos ct + isin wt) 
m 


dt? dt 
Fy. 
= 20t (17.87) 
m 
. : 2 Fo iœt 
zZ+yz+a@jz=—e™ where (17.88) 
m 
z(t) = x(t) + i(t). (17.89) 


This is a special case of Eqn. 17.84 with A = 1 and B = i. 

So, in Eqn. 17.88 I have manufactured a problem in which the thing 
that’s vibrating is not a real number, but z = x + iy. The force driving it is 
also not a real number; it is fo et. The point is that if I can solve the problem 
somehow, I can get x(t) as the real part of the answer. (The imaginary part 
of it, y(t), will be the solution to the fictitious Eqn. 17.86 I concocted.) 

And I can solve Eqn. 17.88 for z(t) very easily because I can now 
make the ansatz 


z(t) = me", (17.90) 


Because every derivative pulls out an iw we have 


P 
[-0? +ioy + | zee! = 2 pit, (17.91) 
m 
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We may safely cancel e”! because it is not identically zero to obtain the 
equation for 2: 


F 
j= o/m : (17.92) 
[—w? + ioy + 05] 
F 
= oe 5 where we have defined (17.93) 
w 
Z(@) = [-0° +i@y + o]. (17.94) 


The magic of the exponential is that the differential equation 17.88 has 
reduced to an algebraic equation for the (complex) amplitude zo 


Z(@)Z = > (17.95) 


which is solved by dividing both sides by Z (œ): 


_ Fo/m 


g= Zo) . (17.96) 
It follows that 

: F, iwt 
P R a (17.97) 


Z(@) 


All we need to do now is take the real part to get x(t). If you thought 
that this means replacing e*”* by cos@t you are wrong, because 


Z(@) = [-@* + iwy +o] (17.98) 
is itself a complex number whose real and imaginary parts can mix with 
the real and imaginary part of e’”'. So here is the correct way to do this. 


Take Z(q@) in Cartesian form 


Z(@) = [os — oo | + iwy (17.99) 
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and write it in polar form 


Z(o) = |Z\e? where (17.100) 
Z| = [02 — of + oy? (17.101) 
at (60) 
 =tan ‘|r|. (17.102) 
0 


Figure 17.5 shows Z in the complex plane. 
Return to Eqn. 17.97 with this result to obtain 


[Fo /m] giat o [Fo/m] gt 


t)= = - 17.103 
z(t) Zio Ze ( ) 

Fo ilog) 
= ` 17.104 
aA ( ) 


Now we can take the real part easily because ay is real. Here is the final 


answer: 
Fo 


= ZI 


cos(wt — $) = xo cos(w@t — p). (17.105) 


Notice that the cause, h cos(wt), produces an effect that is reduced 
in magnitude by |Z| and shifted in phase into cos(wt — @). While there is a 
way to obtain both these transformations with real numbers, it is so much 


10 y 


Figure 17.5 The complex number Z(œ) in its Cartesian and polar forms. 
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easier with complex numbers: dividing the force by a complex number 
Z = |Z\e'®? achieves both these effects in one shot. Bear in mind that the 
phase ¢ cannot be eliminated by choice of the origin in time because it is 
the phase relative to that of the applied force Fo cos wt. 

Let us pause to analyze Eqn. 17.105. Keeping Fo fixed, let us vary a, 
the frequency of the driving force, to see what happens to x, the amplitude 
of vibrations. When œ = 0, that is, when the force does not vary with time, 
we find 


|Z(0)| = Jlo -oP +02y? =o (17.106) 
w=0 
so that 
Fi F 
T (17.107) 
ma, k 


which makes sense: a constant force F will produce a displacement F. 

When œ — œ, we find x) —> 0. Somewhere in between these ex- 
tremes, the response peaks. It is clear that if y is very small, we get the 
biggest response when w = qo: this is when |Z] is the smallest. This is 
called resonance. It tells us that the response of the system to a driving 
force is greatest when the driving frequency equals the natural frequency. 
Imagine you are pushing a kid on a swing, by periodically supplying the 
force. If you are not paying attention and pushing at your own frequency, 
sometimes you will slow the kid and sometimes you will speed up the kid. 
It is best to push exactly when the kid is moving away from you. Note that 
in areal swing y > 0, and there is no danger of the kid flying off to infinity. 

Radios exploit the phenomenon of resonance. Right now this room is 
filled with electromagnetic signals from many stations, and yet you are able 
to listen to the one you want. The trick is that you can adjust the natural 
frequency of the electrical circuits picking up the signal by turning the dial 
to match that of the station of interest. For this plan to succeed, you need 
the graph in Figure 17.6 to be extremely sharp. Imagine that there are just 
two stations at two frequencies. Even if you tune the radio to resonate with 
one, you will be getting a tiny response from the tail of the other one. The 
goal is to keep this interference to a minimum. 

Where are the free parameters in this problem? Everything seems de- 
termined in Eqn. 17.105. What if this solution does not agree with some 


300 Simple Harmonic Motion 


Figure 17.6 The amplitude zo (œ) for a system with wp = 1 and y = 0.1 driven 


by an external force with Š a= 


initial condition, such as a specified x(0) or v(0)? The answer is that we 
can add to this solution (called the particular solution) any solution to the 
equation with F(t) =0, referred to as the complementary solution and given 
in Eqn. 17.73. Thus the most general solution to the driven oscillator is 


x(t) = i maz ot) +e -37t [Aei $ Be] . (17.108) 

Even after adding this term x(t) satisfies Eqn. 17.78 because the 
added terms disappear when we compute the left-hand side. Another way 
to see this is to invoke superposition: consider the right-hand side of 
Eqn. 17.78 to be fo cos(wt) + 0 and add the response due to 0, which is 
the complementary function. The numbers A and B can once again be 
chosen to match the initial conditions, say the initial position and velocity. 
One may forget about the complementary function at large times, because 
it dies out exponentially. 

Finally, consider the force pushing the kid as described earlier. It is 
periodic but not simply the function cos wt. (For example, the force on the 
kid acts only for a small part of each period, while the cosine is non-zero 
except twice in a period.) Amazingly, we can use the technique described 
above to find the response to any periodic force, not necessarily a simple 
cosine or oscillatory exponential function. This is thanks to the mathe- 
matician Joseph Fourier, who showed that any function F(t) with period T 


Simple Harmonic Motion 301 


may be written as a sum of oscillating exponentials, with suitable periods, 
multiplied by suitable coefficients F}: 


[0,0] [0,0] 
F@)= $ Fae" T= $ Fe! where (17.109) 
n=—00 n=—00 
2an 
On= (17.110) 


In the right-hand sides of Eqns. 17.109 and 17.110, we have a sum of forces 
with frequencies @n = ann I state without proof that the coefficients are 
determined by the given F(t) as follows: 


i ¢* 
n= F(t)e"" dt. (17.111) 
0 


We are done because we know the response z(n) due to each oscillating 
term F,e’' in the sum, and by previous linearity arguments the total 
response is the corresponding sum over responses: 


2(t)= > z(n)ent where (17.112) 
E 
z(n) = Za (17.113) 


If the driving force F(t) is real, the z(t) above will automatically turn 
out to be real. If you want more practice using complex numbers, you are 
invited to read the following proof. 

First note that 


_ 2a(-n) 


—On r =) (17.114) 
tf? 
F= zf F(t)et" dt (17.115) 


1 f(T 1 fT . 
= T F(t dt = T F(t)e @C™' dt (17.116) 
T Jo T Jo 


=F. (17.117) 
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Now pair the contributions from the terms in Eqn. 17.112 with any n and 
=n: 


Fy F-n 
zo(n)e®"t + zo(— n) ge'nt = m e'2nt + m ge'nt 


Z(@n) Z(@-n) 
(17.118) 


Fn Fn ii 
= m ei@nt + m el@nt , 
Z(On) Pa 


(17.119) 


which is manifestly real, being a sum of something plus its conjugate. We 
have also used 


Z(@(—-n)) = Z(—@n) = Z* (Œn), (17.120) 
because for any œ we have 


Z(@) = [—w* + iwy + 0] (17.121) 
Z*(@) = [—o* — iwy +05] = Z(—-o). (17.122) 


CHAPTER 18 


Waves I 


We are moving to another topic: waves. Everyone has a good intuitive feel- 
ing for waves. Suppose you drop some object in a lake and you see ripples 
traveling outward from the center. If you keep your eye level with the water 
you will find these ups and downs going outward. This is why one says a 
wave is some displacement of a medium. That’s not a perfect definition be- 
cause electromagnetic waves travel in a vacuum. For this course you should 
imagine waves as what happens when you excite a medium. Once you have 
the example of water waves, you can latch onto that example every time I 
say “wave.” 

Here’s the way to think about the wave. You understand the harmonic 
oscillator pretty well, right? A mass and spring system in equilibrium will 
sit there forever. If you give it a kick, it will start vibrating around its equi- 
librium position. There you have to keep track of only one variable, x(t), 
which is the location of the mass at time t. The wave is an oscillation of 
an entire medium, and that means that at every point in space (where 
there is some medium) there is something that’s ready to oscillate. You 
have then a system with an infinite number of degrees of freedom because 
the height of the water at each point is an independent variable. If you 
don’t do anything to the water, it will stay at the height, but, if you fid- 
dle with it or drop something, it will start vibrating. I’m going to use the 
symbol y(x,t) to denote the change in the height of the water from its 
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undisturbed value at the point x at time t, or more generally, the displace- 
ment of the medium from its equilibrium configuration. Unlike the case of 
the single oscillator, x is not the dynamical variable here. Instead it labels 
the points in the medium, and w(x, t) is the dynamical variable: it is the 
thing jumping up and down. For simplicity I am considering waves in one 
dimension; in general I will have to invoke w (x,y,z, t). 

The wave travels with some velocity v. If you drop a rock ina lake, you 
see when the waves get to the shore, you measure the distance to the shore 
divided by the time taken, and that’s the velocity. If someone lights a fire- 
cracker, you can find the velocity of the sound by determining how far away 
you were from the firecracker and how long it took for the sound to reach 
you. That’s the velocity of sound in air. Unlike the velocity of light, which 
doesn’t depend on anything, all other wave velocities depend on various 
conditions. The velocity of sound in air depends on the temperature, for 
example. Sound can also travel in a solid. You can take an iron rod, which 
you know is made up of a lot of atoms, and hit it with a hammer. You ba- 
sically compress these atoms and they in turn compress the atoms next to 
them, and the shock wave travels through this rod; that’s also called sound. 
That travels at a speed much faster than the speed of sound in air. Thun- 
der and lightning are the most famous examples of different velocities for 
different phenomena: the light gets to you first, and then the sound. 

Waves can be longitudinal or transverse, as shown in Figure 18.1. 

When I talk to you, this is what happens. The air in this room is at 
some constant pressure; the pressure is pushing your eardrum from the 
outside and also transmitted through your Eustachian canal and push- 
ing the drum from the inside. The two pressures balance and you don’t 
feel anything. As I talk, my vocal chords move back and forth and they 
increase and decrease the air pressure. These pressure waves propagate 
through the air to your ear. They hit your eardrum, which feels the chang- 
ing pressure outside, relative to the fixed pressure inside. So the eardrum 
goes back and forth in response. Behind the eardrum are the three little 
bones that transmit the motion of the eardrum to a fluid inside. There 
are little hairs shaking in this fluid, sending signals to your brain, which 
in turn tries to figure out what I am saying. That’s a long and impressive 
chain of transducers of sound energy to your brain. I draw the line at the 
eardrum; I don’t want to go on the other side. But what I do know is that 
a sound wave is a longitudinal wave: the back-and-forth motion of air is 
in the same direction as the sound signal—both are along the line joining 


Waves I 305 
Longitudinal wave 


Motion of medium 


Me talking 
\ Wave Your ear drum 
J velocity 


Transverse wave 


Wave 
velocity 


Cc 
fej 
2 
© 
= 


Figure 18.1 Top: A sound wave is longitudinal: the sound goes from left to right 
and the air also vibrates back and forth. Bottom: The wave on a string is 
transverse: the wave goes from left to right, but the string moves up and down. 


you to me. In a longitudinal wave like sound, the motion of the medium is 
in the same direction as the motion of the wave. 

This is not always the case: In a transverse wave the medium moves 
perpendicular to the wave velocity. Take a string, clamp it down at the 
right end, pull it tight to give it some tension, and then give it a little flick 
at the left end. The blip you create will travel along the string, but the 
displacement of the string (the blip) is perpendicular to the velocity, as 
shown in Figure 18.1. This is a transverse wave. 

Do not confuse the motion of the medium with the motion of the 
signal. The string, at any given point, is either at rest or going up and 
down, but the signal is going from left to right. Even in the case of sound, 
when I talk to you, the air molecules may wiggle back and forth in the di- 
rection of the velocity, but their average position does not change; the 
molecule hitting your ear is not the one that was next to my mouth. So, the 
medium doesn’t actually propagate along with the wave. The game called 
Telephone illustrates this: kids line up and each kid tells the next kid some 
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secret; the secret is eventually transmitted all the way to the end. What has 
traveled is the secret. The kid at the head of the line has not physically 
moved to the other end. Each kid hears something from the neighbor on 
one side and passes it on to the neighbor on the other side. Then each goes 
back to doing nothing. That’s what happens in the string as the blip trav- 
els. A section of the string that is not doing anything initially jumps up and 
down for a while when the blip passes through. Then it settles down and 
the section next to it starts moving. 


18.1 The wave equation 


There are many, many waves: water waves, electromagnetic waves, elastic 
waves, and so on. I want to consider one concrete example of a wave so you 
can develop a feeling for how to handle them. I’m going to discuss waves 
on a string. 

Imagine a string that’s been clamped at two ends (x = 0 and x = L in 
Figure 18.2). The thin horizontal line is the x-axis, and that is the string’s 
position in static equilibrium. Each point on the string is labeled by the 
value of x that it will have when the string is in the equilibrium position. 
The displacement of the string at the point labeled x at time t is denoted 
y (x,t), and this is our new dynamical variable. It is the one for which we 
would like to write the equations of motion. 


6(x+dx) 


X=L 


Figure 18.2 The string under tension T has mass u per unit length and is 

fixed at x = 0 and x = L. The highlighted segment has a width dx, with the same 
tension T pulling the two ends but at slightly different angles. The displacement 
y and angles 0 are exaggerated for clarity. The derivation is valid only when all 
these are very small. 
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The string is under some tension T. That means you hang some 
weights at the ends or you tighten it with some screws, as in a violin. The 
tension means that the string is dying to break apart if you cut it. Without 
the tension, none of what follows would work, as you will see. Another es- 
sential parameter here is u, the mass per unit length. To find it you put the 
string on a weighing scale, you find the mass, and you divide by the length. 
For example, if you have a ten-meter string and it weighs one-hundredth 
of a kilogram, then the mass per unit length is u = 107°? kilograms per 
meter. 

Now, I pull or pluck this string in some way, given by the solid curve 
y(x,0) in the figure, and I want to know what the whole string will do. 
Compare this to the mass and spring system. There you pull the mass out 
to some new location x(0), you let it go, and you want to know x(t). There 
was just one degree of freedom, the location of the mass, x(t). The answer 
was x(t) = x(0) cos wt. Here, at every point x between 0 and L, I have some 
segment of the string. The displacement of each segment from equilibrium 
is a degree of freedom, y (x). I displace all those infinite degrees of freedom 
to y(x,0) at time t, and I let them go. I want to know y (x, t). For this we 
need to find the equation satisfied by w (x, t). 

What authority will decide the behavior of this string? Newton’s law 
is the answer. There are no new laws that Pm going to invoke. Pm not 
going to say, “We studied masses and springs before; today it’s time to study 
strings, and here is the new law of motion.” There’s only one law of motion. 
That’s F = ma. My whole purpose is to show you that this law really does 
control everything; that’s why it’s a super law. 

The string is a long, extended, and complicated object. I isolate a tiny 
segment of length dx highlighted in the figure. I am going to calculate the 
total force on it and equate it to its mass times acceleration. Gravity is not 
necessary for vibrations, and we will neglect its effect. 

The figure shows the two forces on the little segment. Both equal the 
tension T, which doesn’t change from point to point in magnitude. But the 
angle at which the tension acts is not necessarily the same. It is tangent to 
the string, and the direction of the tangent (measured from the horizon- 
tal) is changing from 8 (x) to 0 (x + dx). The string is curving in general; 
therefore, the tangents to the string at the two ends of the tiny bit are not 
quite the same, and there is generally a net force on the bit. 

So, I’m going to find out the vertical component of the two forces and 
take the difference. The upward force at x + dx will be Tsin(@(x + dx)), 
and the downward force on the left side will be —Tsin(@(x)), yielding a 
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total of T [sin(@(x + dx)) — sin(@(x))]. That’s going to be mass times ac- 
celeration. The mass of this little segment is the mass per unit length u 
times the length of the segment, which is dx. Now, what is the acceleration 
in the language of calculus? No, it is not x but Fy Ge because y (x, t) is 
the vertical coordinate of the string bit. What’s jumping up and down is y, 
so the acceleration is its second derivative, and I use the partial derivative 


because y (x,t) can vary with x and t. So F = ma becomes 


a? w(x, t) 


T [sin(0 (x+ dx)) — sin(@(x))] = u dx Ap 


(18.1) 


Now, come to the left-hand side and assume the angles involved are 
very small, that is, that the string does not deviate too much from being 
horizontal. If you remember the series 


0? 
Ee +... (18.2) 
62 
cos = 1 — — +... (18.3) 
2! 
a 6 g? 
tan@ = —4—_ = O——+4+...J){1+—4+...)=6+4... 
1-G+... 3! 2! 


(18.4) 


and keep only terms up to order 0, you may then approximate as follows: 


fa) 
sind ~ 0 ~ tan = GA (18.5) 
Ox 
Equation 18.1 becomes 
ð 8? y (x,t 
_ oy |- beh VOD (18.6) 
x 


ô 
T | oe 
Ox 
Dividing both sides by T and dx and letting dx — 0, we finally obtain the 
wave equation 


x+dx Ox Ot? 


a7 w(x, t) A 8? y (x,t) 
= . (18.7) 
Ox? T of 
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This is a partial differential equation. It is usually rewritten as 


y(t) 1 yt) 
Ox? vw Ot 


(18.8) 


v= |=; (18.9) 


You should verify that v has dimensions of velocity. It will turn out 
to be the velocity of waves on the string. 

In summary, when you pull a string up, it comes down because the 
tensions at the two ends of the string bit have vertical components that 
don’t quite cancel. So, the net force depends on the rate of change of sinf ~ 
tan = By Gat) that is, the rate of change of the rate of change, and that’s 
why you get Sos) on the left-hand side. The second time-derivative on 
the right is just the acceleration of the string bit. 

Two questions are usually asked at this point. From the figure we see 
that dx is actually the horizontal projection of the string bit. Shouldn't we 
use its full length in finding the mass of the bit? Shouldn’t we worry that 
because different parts of the string have stretched by different amounts 
the mass density u is really (x)? We do not worry, because of the small 
angle approximation. By Pythagoras’s theorem the length of the string bit 
is (see the figure) 


2 
dl = y (dx)? + (dw)? = dx +f] (18.10) 


7 1| ow 


1 
= dx(1 + 5 tan’ 0) ~ dx (18.12) 


because tan” is at least quadratic in 6. Thus in this approximation ev- 
ery part of the string is stretched in the horizontal direction by the same 
amount as before it was plucked. That is why the tension has a constant 
magnitude T along its length. 

The second question is why we do not worry about the difference in 
the horizontal components of T at the two ends. The answer is that these 
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components are proportional to cos? = 1+ or +... œ 1 if we stop at first 
order in 0. 

Equation 18.8 is ubiquitous and describes any elastic medium dis- 
turbed a little bit from equilibrium. The small disturbances y obey such 
an equation and v is the velocity of their propagation. 


18.2 Solutions of the wave equation 


What are the consequences of this wave equation? Why is v the velocity of 
the wave? To find out, let us go from the finite string to an infinite medium, 
like an infinite lake. 

To proceed we need to guess a solution to this equation. In the case 
of the oscillator we said, “I have an equation: the second derivative of x is 
equal to some number times x. The answer is cost or sinewt.” But now 
we have the second derivatives with respect to space and second derivatives 
with respect to time. I know that this y is going to oscillate up and down 
in space and time. To break down the problem, imagine a water wave that 
is traveling; the ripples are traveling. If you took a snapshot of the wave at 
a given time, it would go up and down in space. Or, if you stood at one 
point in the water and let the wave go past you, the water would go up 
and down in time. So the wave oscillates in space and time, and we have to 
guess the answer. I could tell you how to deduce the answer, but I do not 
have room here. I’m just going to write down a solution and verify that it 
obeys the wave equation, and then I will analyze it. Now there are many, 
many solutions to the equation. The solution I’m going to write down is 


y (x, t) = Acos(kx — at). (18.13) 


Beware! This k is not a force constant. It is a new symbol called the 
wave number. We know that w must have units of inverse time and k must 
have dimensions of inverse length in order that the argument of the cosine 
is dimensionless. 

Let’s verify that Eqn. 18.13 is a solution to the wave equation. Con- 


sider y, When you take a partial derivative with respect to x it says forget 
about time; treat x as the only variable. So 

a2 
ale ae —Ak? cos(kx — œt). (18.14) 
Ox? 
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Likewise 
82 

S = —Aqw* cos(kx — at). (18.15) 


The wave equation 


a 1 8? 
= =5 = then demands that (18.16) 
x v 
2 
— Ak? cos(kx — œt) = sA cos(kx — œt). (18.17) 
v 


We cancel A and the cos(kx — wt) because neither is identically zero. 
This means that a solution of the form Egn. 18.13 exists for any amplitude 
A. (Of course, A cannot be anything. The calculation assumed sind ~ tan 
and cos@ ~ 1. So, once you make an approximation and you get an an- 
swer, you should not blindly apply the answer to circumstances where 
the equation itself is not valid. Even though the answer says you can have 
any A, in practice you shouldn't use it for an A for which the small angle 
approximation fails.) 

Canceling Acos(kx — wt) from both sides of Eqn. 18.17, we find 
a solution of the assumed form exists provided the parameters œ and k 
satisfy 


faa, (18.18) 
Y 


We will always choose œ to be positive, and the two signs of k will be seen 
to correspond to the two possible directions of propagation. 
Now let us write the solution that respects the condition Eqn. 18.18, 
@ 


considering first the case k= >: 


y (x, t) = Acos(kx — kvt) = Acos[k(x— vt)]. (18.19) 


This equation says that y (x, t), which could have depended on x and 
t separately, depends on them only through this combination x — vt. In 
other words, x and t do not separately determine y (x, t); only this combi- 
nation x — vt does. If the combination has a certain value, y has a certain 
value. If I change x and t keeping x — vt constant, the function doesn’t 
change. I will now argue that this really means it’s a wave that’s traveling 
at a velocity v. 


312 Waves I 


Let’s take the bell-shaped function 
f(x) = Ae**, (18.20) 


It peaks at x = 0. Take the same function with x replaced by x — vt. Where 
does this function 


fx, t) = Ae (18.21) 


have its maximum? It has a maximum when x = vt. So, if you followed this 
f(x, t) =f (x— vt) as a function of time, the peak that was at x = 0 at t = 0 
moves to x = vt at time t: the shape moves undistorted at a velocity v. 

Returning to our wave, it just looks like Acos kx at t = 0, and we all 
know it has a maximum or crest at x = 0. If you wait a bit, where is this 
maximum? If you increase time a little bit by dt, you have to increase x by 
dx such that the two changes kdx and wdt cancel, keeping the argument of 
the cosine at zero. But that means that 


Bis (18.22) 
u kp” j 


That means the crest is moving at the velocity v. 

It follows that y [k(x — vt)] describes a wave traveling to the right at 
speed v. If you want a wave traveling to the left, you should pick k = —°, 
and the solution will be of the form w [k(x+ vt)]. 

Calculus buffs can verify that any function f(x — vt) (not just the 
cosine) satisfies the wave equation. To see this, use the chain rule: if w = 


x — vt then f = f (w) and 


of _ df(w) ow dfw). 
ôx dw ôx dw i ne 


of _ dfw) 


2 
oo ae l (18.24) 


Waves I 313 


of = df (w) Ow _ df (w) Te 


18.25 
ot dw ot dw ) ( ) 


f  a’f(w) 2 
a ae (—v)* so that finally (18.26) 
Ley æ a 
Taa W fo (18.27) 
v2 Ot? dw? Ox? 


from which it follows that not only f(x — vt) but also f (x + vt) satisfies the 
wave equation. Even 


w (x,t) = Ae E =D? (18.28) 


obeys the wave equation, even though it does not readily come to your 
mind when you think of a wave, the way the cosine does. 

Note that every wave in a given medium has a definite velocity, but 
only plane waves have a definite wave number and frequency. In particular 
the wave in Eqn. 18.28 has a velocity v but does not repeat in time or space. 
Once the peak zips past you, it is all over. 


18.3 Frequency and period 


We now ask, “What is k and what is œ in Acos(kx — wt)?” We are all good 
at visualizing a function of one variable, but this is a function of two vari- 
ables. Let us first understand this function at a fixed time, say t = 0. At 
t = 0, it looks like Acos kx, which starts out at A at x = 0 and finishes a full 
cycle when we move a distance x, such that kx = 27. This distance is by 
definition the wavelength A and obeys kà = 27. Thus k, the wave number, 
is related to the wavelength 4 that we understand more intuitively by the 
formula 


A=— or k= —. (18.29) 


Figure 18.3 shows the plot for A= 2, k= 1 (A = 2m ~ 6.28). 

Now, you can ask if I would have obtained a different relation be- 
tween k and å had I picked some other time than t = 0. Think about it: 
cto is some angle øo inside the cosine; it’s just going to shift the whole pat- 
tern by some amount. Changing the phase won't change the fact that the 
peak-to-peak distance is A. 
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J crest crest 


trough trough 


Figure 18.3 The wave Acos(kx — œt) at t = 0 as a function of x for the choice 
A=2, k= 1. For k= 1, should be 27 ~ 6.28 according to Eqn. 18.29, and 
indeed it is. The maxima are called crests and the minima are called troughs. 


Next we ask, “What does it look like to a person sitting at a certain 
location, as a function of time?” For example, the waves could be manufac- 
tured in the ocean and sent toward the shore. You stand at one place near 
the shore; the wave is going past you. You will bob up and down, and that’s 
what I want to describe mathematically. So, I pick some location x = 0 for 
convenience, and then y (0, t) = Acosat. I didn’t miss the minus sign; for 
a cosine, it doesn’t matter if you change the angle to minus the angle. So, 
consider the same graph, this time as a function of time, and ask what hap- 
pens. You start with a maximum at t = 0, and as t increases, the function 
oscillates and reaches its next maximum, a crest at a time T called the time 
period such that 


2 
oT =2n or w= = = inf (18.30) 


where f is the frequency. This relation between w and T should be familiar 
from the single oscillator. The minima are called troughs. 
So, if you like, you can write the plane wave as 


2nx 2at 


wat) = Acos ( 2- E), (18.31) 
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It’s strictly equivalent to Acos(kx — wt), and you may write it either 
way. Equation 18.31 makes it more obvious that when x changes by / or t 
changes by T, nothing happens to the cosine because you are changing the 
argument by 27. 
Let us now rewrite the relation œ = kv in terms of J and f = +: 
2m 
= aL sif, (18.32) 


A 


w 
k 


which can be understood as follows. Suppose I am at the origin and hold 
an infinitely long string. When I start wiggling my end with frequency f, 
the pulses will travel to the right. Let’s wait one second. In one second, I 
would have manufactured f of these full cycles, and each has length 2. Thus 
the wavefront would have advanced to x = Af after one second, which is 
the velocity by definition. I am making 4-sized objects, f per second, and 
pushing them out, so the front of the wave advances a distance 4f in one 
second. 

Figure 18.4 shows the wave varying in both x and t for the choice 
A=k=1 and o = 2 for x and t in the interval [0,47]. If you mentally 
slice it at fixed x or t, shown by the dotted lines, you can count two periods 
in x and four in t. 


AN 


Figure 18.4 The wave Acos(kx — œt) with A = 1,k = 1,@ = 2 or å =2m,T =% 
in the range [0,47 ] for both x and t. If you slice it at fixed t, you will see two full 
periods in x (see dotted line at t = 27), and if you slice it at fixed x, you will see 
four periods in t (see dotted line at x = 27 ). The lines track the maxima or crests, 
but one could pick any point within the cycle. 


CHAPTER 19 
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19.1 Wave energy and power transmitted 


It is obvious that a vibrating string has more energy than a string that 
is not vibrating. I want to calculate the energy in a string vibrating with 
displacement 


y(x, t) = Acos(kx — at). (19.1) 


Now, if it’s an infinitely long string, the energy in it is infinite, so you define 
the energy per unit length. Take a portion of the string, associated with a 
segment of width dx, and ask, “How much energy does it have?” 

The energy has a kinetic part and a potential part. The kinetic part is 
simple: 


1 ôy 
dk = Suds] | (19.2) 


This is just half the mass of the segment times the square of the velocity. 
For the solution in Eqn. 19.1 this becomes 


1 
dK = zudxA o? sin? (kx — ot). (19.3) 
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The potential energy dU has a more subtle origin. The string is un- 
der tension T. When the string is displaced from equilibrium, a segment 
associated with the interval dx has a length (see Figure 18.2) 


dy)? 1lfow)? 
dl = (dx + (dwy = dx fZ] zafi [2] } 
x x 


22 
(19.4) 
Thus it has expanded by an amount 
1| ôy i 
ôl = dl— dx = - | — | dx (19.5) 
2 | ox 


against a tension T. So the work pumped in and stored as potential 
energy is 


1 ow A 
dU = Tôl = -T| — | dx. (19.6) 
2 Ox 
For the solution in Eqn. 19.1 this becomes 


1 1 
dU == TK A? sin? (kx — wt) - dx = 5 A? oo” sin? (kx — ot) - dx = dK 
(19.7) 


upon using 


2 
Tk = T = To — om (19.8) 
v 


Notice that the kinetic and potential energies are equal at every value 
of x and t. In particular, when the string bit is maximally displaced, it has 
no velocity and hence no dK. It also has no dU because the slope is zero 
and so is the stretch ôl. When the displacement is zero, it has the greatest 
velocity and slope and hence the greatest dK and dU. This is unlike the 
harmonic oscillator, where due to the conservation of energy, K and U 
add up to a constant, and when one is large the other is small. This is not a 
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problem for the string bit because it is not an isolated system, and energy 
is pumped into (or out of) it by its neighbors. 
The total energy is 


dE = dK + dU =2dK = pdxA’a’ sin? (kx — ot). (19.9) 
The energy per unit length or energy density u is then 
u(x, t) = uA?” sin? (kx — ot). (19.10) 


Of interest is the average energy density u, where the average is over a 
full cycle in space (a wavelength) or a full cycle in time (a full period T). 
The average of sin? is 5 in either case, and we have the final result 


1 
u= uR w. (19.11) 


What is the average power sent into this wave by the person exciting 
the medium? Imagine that the string is tied to the point at infinity and 
that I have been wiggling it from x = 0. Let us say at this time the waves 
have gone out to some point x = L. The string is inert beyond that point. 
After one more second, an extra segment of length v has begun vibrating. 
What is the average energy contained in the extra segment? It is v times the 
average energy per unit length, ; uA’. So the average energy I pump in 
per second, or the average power, is 


_1 2,2, = 
P= BHA Ov = Uv. (19.12) 


A special property of life in one dimension is that this wave goes 
undiminished in amplitude. You can go 10 miles or 100 miles from me, 
and the amplitude is still the A that I produced at the source, because all 
the energy goes along this line. There is no escape or spreading out. What 
happens in three dimensions is more typical. If you have a very tall tower 
on top of which you put a speaker, the energy radiates out in concentric 
spheres as time goes by. The power at the source is now spread over bigger 
and bigger spheres as you go farther out. In three dimensions we have the 
notion of intensity, which is the power per unit area. If there is a speaker 
sending out sound waves, and I take a one-meter by one-meter window 
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and hold it in front of me and ask how much power crosses it on average, 
that is the intensity I in W/ m. Of course, the window doesn’t have to be 
one square meter big. You can take a tiny square, provided you divide the 
power by its area to find I. You don’t have that notion in one dimension 
because you cannot hold a window in one dimension perpendicular to the 
velocity. 

If you are r meters from the source of sound in three dimensions, the 
power P is spread over a sphere of area 47 r? and the intensity is 


P 2 
I= 75 Wr. (19.13) 


This is true also for electromagnetic waves. 
The intensity of sound, £, is measured in decibels (dB) defined as 
follows: 


I 
É = 10 Logy)— dB (19.14) 
0 


where Ip = 107 !2 W/m?” is the reference intensity, the minimum an average 
human ear can detect. Thus the minimum intensity we can hear is zero dB 
according to this formula. 

A whisper is 15 dB, while a rock concert or jet engine is around 120 
dB. In a rock concert 


I 
10 Logio;. = 120 (19.15) 
I 
—= 10" (19.16) 
Ip 
I1=Ip-10 =1W/m’. (19.17) 


Due to the logarithmic scale, an increase in intensity by a factor of 
100,000 causes an increase of 50 dB. Our sense of loudness seems to grow 
as the log of the intensity rather than the intensity itself. We need the log- 
arithmic scale because the ratio of the largest intensity we can tolerate to 
the smallest we can hear is one trillion! 

The difference in decibels between two different intensities I; and h 
is equal to 10 Logot. 
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19.2 Doppler effect 


Now we will consider a totally different property of wave propagation that 
I’m going to illustrate with sound: the Doppler effect. This term refers to the 
well-known phenomenon that a source of definite frequency, like a siren 
on a fire truck, will be heard as a higher frequency when the fire truck is 
coming toward the observer and a lower one when it’s going away from the 
observer. We want to know why, and by how much, the frequency changes. 
The answer is based on v = /f. 

Let us take a source S in the left half of Figure 19.1 that is sit- 
ting still, emitting waves that spread out spherically. Three equally spaced 
crests are shown at a particular time. You are the observer O, standing 
off to the right listening to the sound. The waves go by you and you ob- 
serve a certain wavelength. That’s the distance from one crest to the next; 
call it Ap. (We need a subscript because soon another / will enter.) As 
usual, 


hofo =v (19.18) 


R A 


Source at rest Source in motion 


Figure 19.1 At the left we have a source at rest and observer O at rest. The three 
crests emitted are equally spaced in the air and also as perceived by O. At the 
right we have the source moving at velocity u. The wavefronts now get crunched 
in the forward direction because the source travels a bit to the right between 
emitting one crest and the next. In the backward direction the opposite happens. 
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where fo is the frequency. Now suppose the source is moving to the right at 
a certain speed u as shown in the right half. It’s not hard to visualize that 
the waves in the air will get squashed in the forward direction to the right: 
having emitted one crest, the source moves to the right a little bit before 
emitting the next crest. The new 4 will be 49 minus the distance the source 
travels in a time T = ie Thus 


A= dp —uT = 4- =. (19.19) 
fo 
What is the new frequency f that you will hear? It is determined by the 
usual relation 


Af =v. (19.20) 


You must understand that, even though the source is moving to the 
right, the velocity of sound is not altered by that process. A moving truck 
does not emit sound at an increased speed in the forward direction. The 
speed of sound with respect to a medium is controlled by the properties 
of the medium, not the velocity of the source. If the medium is air, the 
sound can travel only at a certain speed. On the other hand, if there is a 
machine gun on a moving truck that sprays bullets in all directions at the 
same speed in its frame, then according to a person on the ground, the 
ones in the forward direction would be moving faster than the ones in 
the backward direction. In any case, the new frequency is 


v v v fo 
== = = : 19.21 
f A Ao v u 1 u ( ) 


=u Vs = 
h fh fo v 
The observed frequency f is the normal frequency fo divided by 
1 — ¥ < 1. So the observed frequency will go up. If you want to see what 
happens to the left of the source, you simply reverse the velocity of the 
source (moving to the right) relative to the sound waves (now moving to 
the left to meet the observer on the left) in the same formula. So the answer 
covering both cases is 


f= fo z- [source coming toward (19.22) 


” (away from) static observer]. 
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Your common sense should tell you which sign applies when. 

What happens with light and at relativistic speeds u? The velocity of 
light will be c in vacuum, but the frequency will change for many reasons. 
First of all, the ambulance clock slows down according to you by the stan- 
dard relativistic time dilatation, even if it’s not coming toward you or going 
away from you, but going around you, so that f < fo. This is called the 
transverse Doppler effect. There is, in addition, the more familiar Doppler 
effect due to source motion toward or away from you, which will reduce to 
Eqn. 19.22 when = << 1. 

Now for sonic booms, which can be traced back to 


f= . (19.23) 


Relativity was chock-full of formulas where the denominator could vanish 
but never did because c was the limiting speed. But the speed of sound v is 
not an upper limit at all, and we have planes that travel at u > v. As u— v, 
the crests are getting crunched more and more in the forward direction 
until u = v, and all the crests pile right on top of each other; that’s when 
a sonic boom occurs. If you could go faster than that, you would leave 
behind a trail of waves, with nothing in front of you. So, if there were a jet 
plane coming toward you faster than the speed of sound, you would not 
have time to get out of the way—the jet would hit you before the sound 
waves hit you. 

Now, here’s one more variant of the Doppler effect: the observer is 
moving toward the source, which is at rest. The siren (at rest in the air) 
is emitting nice spherical waves, but you are rushing to meet them with 
velocity —u. What’s the frequency at which you will see crests? In the frame 
of reference of the air, the sound is moving to the right at speed v, and you 
are moving to the left at speed u. So, you are zipping past the waves at 
a speed u+ v, and the crests are spaced at a distance 49 apart. (Nothing 
happens to the spacing between them; they are just as they are in the left 
half of Figure 19.1.) The frequency according to you will be the distance 
you move relative to the medium in one second divided by the length of 
each cycle: 


fa =F (142) =a(it4). (19.24) 
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If you are running away from the source, you should change the sign. The 
general formula is then 


faz (it5)=A(1+5) (19.25) 


[observer moving toward (away from) static source]. 


So, when you move toward the source, the correction factor appears 
in the numerator; when the source moves toward you, the correction factor 
appears in the denominator. You have to use common sense in choosing 
the + sign. For example, it’s obvious that if you're rushing to meet the 
waves you should get a higher frequency. You can go on and ask what hap- 
pens if you are moving and the ambulance is also moving. The fun never 
ends. 


19.3 Superposition of waves 


The rest of this chapter focuses on one essential property of waves called in- 
terference. Look at the wave equation. It’s a linear equation. Don’t be fooled 
by the 2’s in a or r, That doesn’t make it quadratic; it’s still the first 
power of y on both sides. Then verify the following in your mind: If yı 
and yn are solutions, so is yı + y2. What that really means is the follow- 
ing. Suppose you emit some sound and youre sending some wave y that 
travels through space. Then, I turn you off and I turn on another speaker; 
let’s say the speaker emits another sound described by w2. If you and the 
speaker emit sound at the same time, then the air disturbance will be sim- 
ply the sum yı + w2. So, if one cause produces one wave and a second 
cause produces a second wave, then, when both are turned on, the wave 
they produce will simply be the sum. This is the principle of superposition, 
and it follows from the linearity of the wave equation. From here until the 
end of the chapter we will analyze a variety of situations where we add two 
waves. 

The simplest problem is the following. Pick a certain spot, say x = 
0, and listen to two plane waves that are now functions of just t: yı = 
Acosq@ t and y2 = Acosq@pot. I’ve chosen the two waves to have the same 
amplitude, but not necessarily the same frequency. So, the signal you will 
hear will be 


y = yi + y = Acoswıt + Acos aot. (19.26) 
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Now, you can use the trigonometric identities and figure out what hap- 
pens, but I want you to think a little bit first. Think about these two waves. 
If they have the same frequency, it’s trivial, right? If œ = œ = o, then 
y = 2Acosot. That just means they reinforce each other. At every instant 
you get double the y you got before. Say the frequencies are not equal. 
Initially they are in step. This cosine is 1, and that cosine is 1, and they add 
up to 2A. As time goes by, œıt and wt start differing, and the cosines are 
not in step anymore. After a while the two cosines are off by half a cycle, 
or the angles inside them differ by an odd multiple of z. Then they will 
completely cancel. If you wait longer they will again be in step and so on. 
One way to think about it is to imagine two runners going around on a cir- 
cular track. They start out together, but they have slightly different speeds. 
As they go around, one starts lagging behind the other, and they will be on 
different parts of the circle. If you wait long enough, they will again line up 
at the start line but with one difference: the fast runner will have done an 
integer number of laps more than the other. 

Consider two frequencies œ = 2 and œ = 3, which are small enough 
for us to follow the oscillations, as depicted in Figure 19.2. At t = 0 they 
are in step (runners start off together), at t = m their phases differ by z 
(runners are on opposite sides of the track) and they cancel, and finally at 
t = 2m they are in step (the slow one has done two laps and the fast one 
three). This repeats every 27 seconds. 


Cos 2t + Cos 3t 


Figure 19.2 Superposition of two waves of unit amplitude A = 1 and angular 

frequencies œ; = 2, and œz = 3. At t= 0 they are in step, at t = a their phases 

differ by z and they cancel, at t = 27 they are in step, and this repeats every 27 
seconds. 
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Next consider the case where œw and œ are large but differ by a tiny 
amount. Now we are better off invoking 


1 1 
cos A + cos B = 2 cos [z+ B| cos Ea — B| 
in Eqn. 19.26 to find 
1 1 
y(t) = 2Acos Fo — on)t| cos Fo + on)t| . (19.27) 


Suppose for example œ — œ = 2 while œ + @2 = 2- 10°. Then Eqn. 19.28 
reads 


y(t) = [2Acos t] cos [10° t] . (19.28) 


In the time it takes the first cosine to finish one cycle, the second 
has completed a million. Conversely, in the time it takes the second co- 
sine to finish 100 cycles, the first has hardly changed. Thus we can treat 
[2Acost] as the slowly varying amplitude for the second rapidly oscillat- 
ing cosine. The rise and fall of this amplitude is called the beat and can 
be picked up by the ear. Figure 19.3 shows what happens when @ = 41 
rads/s and œ = 39 rads/s. Notice that the envelope of the rapid oscilla- 
tions is itself a cosine and that the time T between two maxima is z and 
the beat frequency is wp = _ = 2 = œ; — @. You might have thought it 
should be œp = (œ — @2), based on the argument inside the first cosine 
in Eqn. 19.28. It is true that only after a time 27 the amplitude returns 
to 2A, and that after a time T = m it returns to —2A. However, —2A is 
still an amplitude maximum, just as loud as 2A. Alternatively, think of the 
time period of the beats as the interval between two successive zeros of the 
amplitude. These occur twice in every full cycle of cos [F(a — œ2)t]. 

Piano tuners use beats as follows. They hit the tuning fork, and it 
vibrates at some prescribed frequency, say f = 440 Hz. Imagine that your 
piano is slightly off; maybe it’s at 438 Hz. The beat frequency will be 2Hz. 
The tuner will keep fiddling with the piano until the beat (not the total 
sound) disappears. 
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Cos 39t + Cos 41t 


i A | h il, il H il | | 


Figure 19.3 Beats due to two waves of unit amplitude A = 1 and large but 


nearly equal angular frequencies œ; = 41,@2 = 39. The (angular) beat frequency 
is supposed to be œ, = 41 — 39 rads/s, which means a time T = a = m between 
beats, in accord with the figure. 


19.4 Interference: the double-slit experiment 


Now we turn to a more complicated problem where I’m going to add two 
waves. In the case of beats I sat at one location and added two waves com- 
ing to me as a function of time. Now I am going to see how two waves 
interfere at different points in space in what is called the double-slit exper- 
iment, depicted in Figure 19.4. It shows a huge tank of water as seen from 
the top. There is a vibrating source E at the left end that emits water waves. 
These wavefronts will be concentric circles near the source, but if you go 
far to the right of the source, you can treat the wavefronts as just parallel 
lines. The figure shows three crests and two troughs, the wavelength 4 and 
amplitude A. Next these waves hit an impenetrable barrier with two slits in 
it, labeled Sı and S2. These two slits will themselves start generating their 
own waves that radiate out from them to the right. The figure shows a crest 
and trough from each using solid and dotted lines. 

Suppose you have a detector of water waves D (say a floating piece of 
cork skewered on a vertical rod), which you can place anywhere along the 
right side of the rectangular experimental region to measure the amplitude 
of water waves. What amplitude will it measure at different points? 

First take a point M exactly midway between these two slits, lying on 
the perpendicular bisector of the line joining the slits. What are you getting 
here? You are getting a signal from each slit. They will arrive in step or in 
phase, because they were both generated by the same plane wave from the 
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(Amplitude)? 


| | 
| | 
| | 
| | 
| | 
| | 
| | 
| | 
| | 
2A! 
4 Back wall 
À 


Figure 19.4 The double-slit experiment, from left to right. We are looking down 


at a tank of water. Waves are emitted from E, far enough away that the crests and 
troughs are parallel lines. Three crests (solid lines) and two troughs (dotted lines) 
are shown. They hit the two slits Sı and S2, and the semicircular waves emerge 
from the slits. A few maxima (and minima) are shown by solid (and dotted) 
lines. The thin and thick notches on the lines marked L; and Lz correspond to 
maxima and minima along these two paths. The interference pattern at the far 
right is measured by a detector D, which can slide along the back wall. The point 
M is a maximum and X is a minimum. The maxima and minima measured by 
the detector refer to the variation of the amplitude squared due to interference of 
two waves, not to be confused with the oscillations of y itself, as shown to the left 
of the slits along with its 2. The amplitude squared with just one slit open is 
shown by a dark dotted line and is assumed to be essentially constant. 


left that hit the two slits in phase and have traveled the same distance to 
reach M. So, when a crest from Sı reaches M, a crest from Sz will reach 
M, and likewise for the troughs. Assuming each slit generates a wave of 
amplitude A, their sum at M will have an amplitude 2A. (We ignore the 
slight decrease in amplitude as the waves spread out from the slits.) So the 
water will be very choppy at M. If you shut off one slit, the amplitude will 
go back to just that from one hole (shown by the nearly constant dotted 
line). It makes sense that two open holes give you 2A. 

But consider the point X above M, where the figure shows a snap- 
shot of the two waves arriving from the two slits, with crests and troughs 
indicated by thick and thin markers. Let us denote the distance from the 
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two slits to X by Lı and L2. Whereas at M we had Lı = Lz by symmetry, 
that is not so at X. At X, the difference Lı — Lz = 4 As you know, between 
every two crests there is a trough where y is minus the amplitude, unlike 
at the crest where y is equal to the amplitude. So when Lı — L2 = 4, when 
a crest from S arrives, a trough from S; arrives, canceling it. Later, when 
a trough from Sp arrives, a crest from Sj cancels it. In fact, at every instant, 
whatever signal comes from S2, minus that will come from Sı. This is the 
case because a path difference of 4 corresponds to a phase difference of m 
in the function cos — The water will be completely still at X at all times. 
The amplitude there will be zero. This point X is called an interference 
minimum, while M is an interference maximum. At M the interference is 
constructive, and at X it is destructive. 

That point X is very interesting. Look what the theory is telling you. 
It says that if you sit there, the water, which was bobbing up and down with 
just one slit open, wont move at all when there are two slits open. It’s the 
property of waves: two waves can lead to nothing, but one wave cannot— 
it has nothing to neutralize it. This is something you have to get straight. 
For example, if a hole in your mosquito net is letting mosquitoes in, it will 
not help to make another hole. Interference does not exist for particles like 
mosquitoes, because mosquitoes are never negative in number. 

Now, if I go further up the right side of the rectangle, I reach a point 
where the difference Lı — L is the full wavelength, and then I’m back to 
being in step. Of course, a full wavelength means that the 13-th crest from 
Sı and the 14-th from Sy arrive in step, but we don’t care, as long as a crest 
comes with a crest; we have constructive interference. As we move up and 
down the back wall, the amplitude of the sum of the two waves oscillates, 
as indicated by the solid wiggly line, which has a maximum at M, a mini- 
mum at X, and so on. Remember that the maxima and minima measured 
by the detector refer to the variation of the amplitude (squared) due to in- 
terference of two waves, not to be confused with the oscillations of y itself, 
as shown to the left of the slits along with its 2. (The dotted line is the am- 
plitude squared due to just one slit, and it is roughly constant. At a point 
where this amplitude is, say, 5 cm, the water goes up and down by 5 cm.) 

For those who want to go a little deeper, the semi-circular waves 
emanating from the slits have the form 


ywi(r,t) = Acos(kr; —a@t) and yo(r,t) = Acos(kr2 — œt) 
(19.29) 
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where rı and r are the distances from the slits to the point r we are inter- 
ested in. (Actually, in two dimensions, A itself falls off with distance, but 
that is a much smaller effect, which we ignore.) At any given time, the lines 
of constant phase for yı and y2 are semi-circles centered at S; and S2 re- 
spectively, because the phase depends only on rı and r2. Pick a point for 
the detector. It has some value of r; = Lı and r = Ly. At any given time t 
we can ignore the time-dependent phase wt common to both waves. Their 
phase difference is then simply k(L; — L2) = 2h = L2). It follows that 
a path-length difference md, where m = 0,+1,+2..., causes phase differ- 
ence of +27 m and hence constructive interference, while a difference of a 
half-integer wavelength causes a phase difference equal to an odd multiple 
of z and hence destructive interference. 
The exact condition for destructive and constructive interference is 


In-Ljy=md m=0,+1,+2 for (19.30) 
constructive interference 


1 
lh-ti= (m+ L m= 0, +1,2 for (19.31) 
destructive interference. 


To find the point of constructive or destructive interference is a mat- 
ter of simple geometry. For example, to locate the first point of destructive 
interference, you measure the distances from the slits, using Pythagoras’s 
theorem, take the difference, and set it equal to 4 However, we often em- 
ploy the following approximation for computing Lz — Lı, when the back 
wall and detector are at a distance far greater than d and J. Then the lines 
going from the slits to the detector are nearly parallel, as shown in Figure 
19.5. Say they leave at an angle 0 from the forward direction to reach the 
detector. Then you can see that the extra distance from the lower slit is 
dsin where d is the distance between the slits. (The equality of the two 6’s 
in the figure should be familiar from the dark days of the inclined plane.) 
Thus we may set 


lL = Li = dsing (19.32) 


in Eqns. 19.30 and 19.31. Given the values of d and À, one can say at 
what angle 0 one will observe the first minimum and so forth. Conversely, 
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Figure 19.5 Computing the path difference in the double-slit experiment when 
the detector is very far from the slits. The dotted lines are the troughs. 


given d and the angle 0 at a maximum or minimum, one can deduce the 
wavelength J. 

In the early nineteenth century it was not clear whether light was a 
wave or a stream of particles. Thomas Young performed a double-slit ex- 
periment with light that showed an interference pattern and thereby nailed 
it: it is impossible to reduce the number of particles coming to a point like 
X (to zero) by opening a second slit. He also managed to deduce the wave- 
length from knowing d and 0, without knowing exactly what medium was 
supporting light waves and what it was that was oscillating! 


19.5 Standing waves and musical instruments 


I take a string, attach one end to a wall, and start shaking the free end. Let’s 
say I shake it just once and send a pulse moving to the right as in part A of 
Figure 19.6. When this blip gets to the wall it finds the string cannot vibrate 
there. Thus the wall will exert a suitable force to ensure that the signal the 
wall sends out, along with the signal I send in, add up to zero there. The 
result of that is that the signal generated by the wall becomes the reflected 
wave, reversed in direction and sign as shown in part B. 

Here is another way to understand the reflected wave. Suppose there 
is no wall, and I live to the right of where it used to be. I see you send a 
blip to the right as in Part A of the figure, and I manufacture an inverted 
mirror image of that traveling toward the reflection point R, as shown by 


Waves IT 331 


—_ 
R 
A = SG Pa ~ 
/ 
\ 
, / 
I 
AS 
re 
B 
<— 


Figure 19.6 Part A: The blip I generate (up-and-down arrow at left) goes to the 
wall at the right (solid vertical line). Part B: The blip gets reflected from the wall 
at point R, reversed in direction and in sign. Part C: There is no wall, but a blip 
comes from the right and cancels the original blip at the wall for all times. The 
former keeps moving to the left and becomes the reflected blip of part B. 


the dotted curve in part C. I arrange it so that when the two blips cross, 
they exactly cancel at the putative wall location for all times. Then, they 
go through each other (by the superposition principle), and my pulse will 
cross over to the left as your reflected wave. Because there is no movement 
of the string at R under this combined influence, it means that even if I 
clamp the string there with a wall that doesn’t allow the string to move, it 
doesn’t change the outcome. That is how we convince ourselves that the 
incoming pulse gets reflected as described. 

Now I want to study, not a single incoming one-shot pulse, but an 
incoming periodic wave y; = Acos(kx — wt). A continuous signal goes in, 
and how does it come out? It comes out as 


w(x, t) = —Acos(—kx — wt) = —Acos(kx+ œt) (19.33) 


where I’ve chosen the wall to be at x = 0 and reversed the amplitude and 
the direction of motion of the incoming wave to get the reflected wave. I 
keep sending stuff to the wall, and the wall keeps reflecting that back to 
me. Both waves coexist in the region to the left of the wall to yield 


y (x,t) = wit wr = 2Asin[kx]sin[@t] using (19.34) 


cosx — cosy = 2sin [i], (19.35) 
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So the string oscillates with a frequency w everywhere, but the am- 
plitude at a point x is 2Asin kx. This is called a standing wave. Whenever 
kx = 0,—a,—2z... there is no vibration. (Remember x is negative to the 
left of the wall.) Such points are called nodes. Exactly in between two nodes 
at kx = —2,—32,..., et cetera, are anti-nodes where the amplitude has the 
maximum of 2A. Figure 19.7 shows a standing wave on a string for the case 
A= .5and k= 1 or A = 27. The vibrating string is a blur, shown by the 
shaded regions. Note that the distance between two nodes is 4, 

If you grab a node with your fingers, it won't matter because the 
string was not planning to vibrate there anyway. You can also grab two 
nodes and it still will not matter. Using this fact, we can solve the following 
problem of a string that is clamped at both ends, x = 0 and x = L. What 
are the possible frequencies of vibration? To solve this, we first ask, “What 
are the possible standing waves?” The answer: the string length L has to be 
equal to an integer multiple n of tA. But the string length has already been 
given to us as L, so it is the possible values of 2, or kpn, labeled by n, that 
have to adjust themselves to obey 


À 2 nT 
L=n— n=1,2,3, which means k, = a (19.36) 
2 An L 
o= ky — (19.37) 
nv 
h= a nfi (19.38) 
Incident + reflected wave 
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Figure 19.7 Nodes and anti-nodes on a string with A = 0.5 and k= 1 or 


2 = 2r . The incoming and reflected waves form a standing wave, with nodes at 


3m 


0,-—2,—27,... and anti-nodes at —2,—=%,.... The vibrating string is a blur, 
7 2 8 8 


shown by the shaded region. 
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where I have used œ = vk and where fi is called the fundamental frequency. 
The displacement of the string in a mode of vibration labeled by n is 


_ parx] . [nave 
w) = Asin "™*) sin] i f (19.39) 


which I got from Eqn. 19.34 using œn = knv. The first three modes are 
shown in Figure 19.8. 

A string clamped at two ends can only vibrate in either the funda- 
mental frequency œ = 4 or an integer multiple of it, where the integer 
gives the number of anti-nodes over the length of the string. 

Now, this reasoning can be bodily lifted for the following problem. 
Take a tube of length L, blocked by walls at the two ends. The longitudinal 
sound wave can go back and forth and form standing waves. The equation 
obeyed is still the wave equation, and the two ends are again nodes because 
the walls will not permit any longitudinal vibration there. We can use the 
same picture as in Figure 19.8 and the frequencies w,, will be given by the 
same formula. The only subtlety is that now the graphs in Figure 19.8 do 
not represent transverse motion of the molecules, but instead they measure 
the longitudinal displacement from equilibrium. If the height of the graph 
is 2mm somewhere, it means the molecules there are moving back and forth 
(and not up and down) by 2mm. Out of habit, we plot everything on the 
y-axis, but that doesn’t mean the motion is perpendicular to the tube. 


Figure 19.8 The plots of Asin [=] for n= 1,2,3 and L = z . The mode with n 


half-wavelengths has a frequency @n = “F. 
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Figure 19.9 From the top: First two modes of vibration of a tube of length z 
closed at the right end and open at the left end followed by the first mode of a 
tube of length z open at both ends. You can read off the 2’s by inspection and 
deduce the frequencies from v = /f. 


Here is a challenge. I take a tube, closed at one end and open at the 
other, like a Pepsi bottle, and blow into it. I hear a whistle at a certain fre- 
quency. What determines that? Because ’m making the noise at the open 
end, it is an anti-node. That is where the amplitude will be the biggest, and 
of course there is a node at the closed end where longitudinal motion is 
impossible. So, what pattern can I draw? I want maximum vibration at the 
open end and nothing at the other end. The lowest one I can draw looks 
like the top part of Figure 19.9. Because we see a quarter of a cycle on the 
interval of length L, we deduce 4 = 4L and the frequency f = 77. The mid- 
dle picture corresponds to f = z, When one end is open and one end 
is closed, the frequencies are odd multiples of the fundamental frequency 
f= a: 

Finally, consider a tube open at both ends, with anti-nodes at the 
ends. The vibration is depicted in the lowest part of Figure 19.9. Clearly 
A = 2Land f = >z. And again, you will find here all frequencies are integer 
multiples of the fundamental. So, the story is this: as long as the two ends 
are both open or both closed (both nodes or anti-nodes), all frequencies 
are integer multiples of the fundamental one. If one end is open and the 
other is closed, you'll get odd multiples of the fundamental. 

A challenge: How does a trumpeter vary the pitch by varying the 
length of the vibrating air column or a violinist by moving her fingers? 


CHAPTER 20 


Fluids 


20.1 Introduction to fluid dynamics and statics 


This is a relatively simple topic. If you took any kind of high school physics, 
you would have seen fluids. Whenever I say fluid, you are free to imagine 
water or oil. 


20.1.1 Density and pressure 


Let us begin with a basic property of the fluid, the density, denoted by p. 
The density of water is py = 1,000 kg/m>. The more subtle concept is the 
one of pressure. If you dive down to the bottom of a swimming pool, the 
pressure goes up. What is the formal definition of pressure? That’s what I 
want to explain. 

If we pick a point in the fluid and say the pressure there is such and 
such, we mean the following. Say you get into that fluid and you want to 
carve out a little space for yourself, maybe a glass cube, and you want to 
live inside that cube. The water is trying to push you in from all sides and 
compress this cube. You therefore have to push out on all the walls. If the 
force you exert on a wall is some F and the area of that wall is A, that ratio 
is the pressure. The pressure will not depend on which wall you choose, 
provided the cube you are in is infinitesimal. The pressure is an intensive 
measure of how hard the water is trying to push in. Even if you don't insert 
the cube, that pressure is still there, but one way to measure the pressure 
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Figure 20.1 A gas in a cylinder with a massless piston of area A on top. The 
mass m exerts a force F = mg and a pressure P = re Pascals. 


is to try to go in there and push the fluid out and ask how hard it pushes 
back. The unit of pressure is N/m? and is called a Pascal. 

Here’s another example of pressure. You have a gas inside a cylinder 
as shown in Figure 20.1, with a massless piston at the top. The pressure 
of the gas and the pressure of the outside world are the same. But if you 
want to increase the pressure in the gas, you can put some extra weights 
on the piston. That mg will push down, and mg divided by the area of 
the piston A will be extra pressure P = “® Pascals that you apply. That’s 
also the pressure of the gas if there is no atmosphere outside. If there is at- 
mosphere outside, the total pressure is the atmospheric pressure plus =. 
The atmospheric pressure is everywhere. So, when you push down on the 
piston to compress the gas further, you are adding to the atmospheric pres- 
sure this extra force divided by area. The total is called the absolute pressure, 
and the extra bit due to the mg is called the gauge pressure. For example, 
when your car has a flat, the bright side of it is that the pressure inside 
the tire is not zero; it equals the atmospheric pressure. But that does not 
help you because the same pressure is also outside. When you stick a gauge 
in the valve stem and you measure something, say 32 psi, that’s the gauge 
pressure. 


20.1.2 Pressure as a function of depth 


Pressure is a condition in a fluid. One important property of pressure in 
a fluid is that all points at a given depth have the same pressure. We un- 
derstand that as follows. I imagine a little horizontal cylindrical section of 
the same fluid, shown by dotted lines in the lower part of the container in 
Figure 20.2. Remember this is not a real cylinder. This is the same fluid, 
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Figure 20.2 The horizontal dotted cylinder marks a part of the fluid in 
equilibrium under forces from the left L and right R. The vertical dotted cylinder 
marks a part of the fluid in equilibrium under forces from the top T and 

bottom B and its weight Mg. 


and I have mentally isolated a part of the fluid that looks like a cylinder; 
this mental image is represented in dotted lines. 

Can the pressure on the two sides—L and R—be different? No, be- 
cause if the pressure on the left were bigger than the pressure on the right, 
the pressure times area on the left would exceed the pressure times area on 
the right, and the fluid would move to the right. But it’s not doing any- 
thing. It’s in equilibrium, and the only way that can happen is if it is being 
pushed equally from both sides. So, the pressure cannot change at a given 
depth. 

Now let’s take a cylinder that is vertical, with the top and bottom 
labeled T and B. It has a base area A and height h. The argument I just 
gave to explain why the pressures at L and R in the horizontal cylinder had 
to be equal will now tell you that the pressures at the top T and bottom 
B cannot be equal. If they were equal, acting on equal areas, they would 
produce a net upward force of zero, and there would be nothing to keep 
the cylinder of water from falling down. So there must be a net upward 
force to equal the weight of that cylinder of the fluid. We are now going to 
calculate the pressure difference using this notion. 

Let’s call the pressures P and P, at depths hı and hz and equate the 
net upward force to the weight of the marked body of fluid, which is g 
times its mass M = pAh: 


A(P2 — P|) = Mg = pAhg = pAg(h2 — hy), (20.1) 
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which may be rewritten as 
P = Pi + pg(hz — hy). (20.2) 


The area A cancels out, as it should, because the pressure difference cannot 
depend on what body of water I choose to focus on. 

How about forces on the sides of the cylinder? They cancel at every 
height, because at every height the push from the left and right is equal. 

It’s standard to take 1 to be the point right at the surface (so that 
hı = 0), to take point 2 to be any point inside, and to call the depth of 
that point simply h. Thus h increases as we move down the fluid. The pres- 
sure at any point in the fluid is the pressure at the top, which is usually 
atmospheric pressure P4, plus pgh: 


P= P4 + pgh. (20.3) 


The pressure at the surface of the lake is due to the atmosphere. If you dive 
to some depth h, the pressure goes up by pgh. If you go to the bottom of 
the ocean, there will be an incredible amount of pressure outside, while 
inside there is just the atmospheric pressure in your lungs. Thats why the 
human body cannot survive at the bottom of the ocean. That’s why a sub- 
marine must be engineered to withstand the pressure difference. But fish 
don’t have that problem, because fish are breathing the water. The water 
is going into their system and pushing out with the same pressure as the 
water outside. 

Now, how about atmospheric pressure? The atmospheric pressure 
arises because we are ourselves living in the bottom of a pool, but a pool 
filled with air. The density of the air above our heads decreases with alti- 
tude and essentially vanishes beyond a height h ~ 10° m. The atmospheric 
pressure at the bottom of this pool, due to the entire air column above, 
is P4 = 10° Pascals. So, we are living in the bottom of a pool where the 
pressure is 10° Pascals, relative to interstellar space. That pressure doesn’t 
kill us because it acts both from the outside pushing in and from inside, 
pushing out. But you must have seen the dramatic experiment where a can 
is heated until some of the air expands and escapes, and then the can is 
sealed and cooled: the resultant drop in pressure is big enough to cause the 
can to implode. 

Now, ask yourself the following question: If our atmosphere, instead 
of being a column of air, was a column of water, how high would it be to 
exert the same pressure? The answer is 
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10°Pa = pygh. (20.4) 


The density of water is py = 10° kg/m’, and let’s pretend g = 10ms~?. 
Cancel all the powers of 10, to get h ~ 10m, or roughly 32 feet. 

Now we are going to put the formula P = Po + pgh to work. The first 
thing you can do is to build yourself a barometer. The barometer tells you 
what the pressure is right now. When I said that the atmospheric pressure 
is 10° Pa, I was using average pressure. Atmospheric pressure doesn’t re- 
ally stay locked at that value. Each day there are fluctuations. That’s why 
the weather report tells you the pressure is going up or the pressure is go- 
ing down. Here is one way to measure atmospheric pressure. You take a 
container, fill it with some liquid, take a test tube, evacuate it completely, 
and stick it head first into the fluid as shown in Figure 20.3. There is a 
complete vacuum at the top of the tube. The atmosphere is pushing down 
on the surface (point X), so the fluid will rise up to some height h. What's 
going to be the height? It will be such that the pressure at the point Y, level 
with the fluid outside, equals the pressure at the surface of the fluid outside 
the tube (point X). We are simply equating the pressure at two points X 
and Y, which are at the same height. 

Now X is at the atmospheric pressure P4, which we are trying to mea- 
sure. At Y the pressure is the zero pressure at the top of the tube, plus the 
pgh of the fluid. Thus 


Pa =0 + pgh. (20.5) 


n] 
Y X 


Figure 20.3 A barometer. The pressure at X, the atmospheric pressure Pa, 
equals the pressure at Y, which equals 0 from the evacuated region, plus pgh of 
the vertical column. 
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So, if you build this gadget, this barometer out of water, the water column 
will rise to the height of roughly 32 feet. But nobody wants a gadget 32 feet 
high, so what you use instead is mercury, because it’s very dense. You want 
to get the same atmospheric pressure, but you want to have a bigger p 
and a smaller h, which is close to 760 millimeters. That’s why the weather 
guy says that the pressure today is so many millimeters, that the mercury 
is dropping, and so forth. Now, I’m not sure why they bother to give the 
numbers, because for most of us, those numbers don’t mean anything. 
Here’s a number: 746 millimeters. Does it speak to you? Not to me. It’s not 
like saying that the temperature today is 67 degrees Fahrenheit, for which 
I have a real feeling. So the weather report on millimeters of mercury goes 
through me like a beam of neutrinos. You can use any fluid you like, but 
you have to remember your choice. Thus, if the reading is 760 millimeters, 
be sure it is mercury, because if it’s water the person is talking about, you 
are in serious trouble. 

Imagine you are trying to drink from a straw. Now, remember the 
fluid is now water. It’s not mercury now, because you're doing a different 
experiment. You know that when you drink from a straw you create a par- 
tial vacuum in your mouth, like the top of the evacuated test tube in the 
figure, except the pressure is not zero in your mouth—it’s just lower than 
the atmosphere. So, you have to reduce the pressure more and more in 
your mouth until this fluid can climb up to your mouth. If you just want 
the water to make it to your mouth, the reduced pressure you need in your 
mouth depends on the height of the straw. If you want to drink water from 
a well that is more than 32 feet deep, you are out of luck. Even if your 
head is a complete vacuum, you cannot get the water to climb more than 
32 feet. 

Next, I give you a fluid that doesn’t mix with water and I tell you to 
find its density. There are many ways to do this. One is to find the mass 
and volume of that fluid and divide. But here is another option. You take 
what’s called a U-tube, shown in Figure 20.4. No, that’s not where they post 
all the embarrassing videos, or the lectures on which this book is based; it 
is a physics contribution to pop culture that somehow did not do as well. 
Let’s say you fill it up with one fluid in one leg and the other fluid in the 
other: oil on the left and water on the right. The figure is supposed to 
tell you right away that oil is less dense than water. We can quantify that 
by equating the pressure at two points at the same depth, say along the 
dark dotted line. The pressures must be equal at all points on that line: if 
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Figure 20.4 A U-tube, used to find the density of a fluid relative to another with 
which it does not mix, such as oil (left) and water (right). Equating the pressures 
at two points that lie at the same height in the liquid (on the dark dotted line), 
one deduces pi gh; + pogA = p2għ + p2g A. One can use the light dotted line as 
well for comparison by subtracting p2gA from both sides. 


we isolate an imaginary horizontal cylinder there, it should not be pushed 
sideways. So we conclude 


Pat pigh + pogA = Pat pog(h2 + A). (20.6) 


Canceling P4 and p2g A from both sides, we find the desired relation 
between the densities 


es (20.7) 

hyp pi 
If you know the density of one fluid, you can find the density of the other. 

Notice that even though you cannot connect points lying on the light 
dotted line with a cylinder of water, you can equate pressures there by start- 
ing at the solid dotted line and working your way up the same fluid. But 
you cannot compare the pressures at two points on a horizontal line above 
the light dotted line, say a line that passes the top of the water column, 
because it does not pass through the same fluid. 


20.2 The hydraulic press 


Now for yet another application of the fact that the pressure in a fluid 
is equal at two points at the same height: the hydraulic press. Figure 20.5 
depicts two pistons of different areas Ay and A2 connected as shown and 
filled with a body of some incompressible fluid, which means its volume 
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Figure 20.5 A hydraulic press that amplifies the force is made of two cylinders 
of different cross-sectional areas A; and A2, connected by a horizontal segment. 


Given P, = 2 =A = P,, the equality of the pressures, and Ax) = A2% (the 


2 At 
equality of volumes displaced in the two sides), we find P = zZ = 2. A small 
force F, acting on the right cylinder pushes the incompressible fluid by a distance 
x2, and this leads to large force F; on the left that moves the piston over a smaller 


distance xı. Energy is conserved because F) x; = F)x2, as in a lever. 


cannot be changed by changing pressure. Now, no liquid is really incom- 
pressible, but close approximations exist, like water. On the left piston will 
be some object I want to lift by applying a force on the right piston. 

When I push down on the right piston with a force F2, what happens 
at the other side? If these pistons are at the same height, we know that 
Pı = P2. However, this equality is essentially true even if the pistons move 
a bit at the two ends, because most of the pressure is due to the forces 
on the pistons and not the pgh contribution of the fluid. On equating the 
pressures 

A = a which means F; = a (20.8) 

A Aj A2 

Suppose we have an elephant at the left of weight F,, and I want to lift 
it by applying a force F, at the right. If A = 1000, if I apply one Newton at 
the right, I'll lift a 1,000-N elephant on the other side. 

But this is not the oldest trick in the book. An even older one, in- 
vented by cave people, is that if you have a lever you can amplify the force. 
But you must know from that example that you don’t get something for 
nothing. In other words, when I lift the elephant, the fact that F; # F> 
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is perfectly okay. But the work I do at the right must be the work delivered 
at the other side; that is the law of conservation of energy. The work I do 
is the force multiplied by the distance, and the force in turn is the pressure 
times area. So we want to confirm that 


Fix = Fax or PAX = P24A2x%2. (20.9) 


Because Pı = P3, we require that A1xı = A2x2, which merely equates the 
volume displaced at the two sides. This means the total volume of the fluid 
is unchanged, which in turn is assured by incompressibility. I cannot get 
more joules out than in, using any device. But the press is still useful, be- 
cause I may be willing to move a whole meter at the right to lift the elephant 
by one millimeter. 

This is also how the brake in your car works. You pump the brake 
pedal, which pushes on a narrow cylinder filled with brake fluid. You push 
it quite a bit, several centimeters at your end; at the other end is a broad 
cylinder whose piston pushes on the drum, retarding the rotating wheel. 
It moves a very tiny amount but exerts an enormous amount of force. The 
brake fluid has the same pressure at both places, but the force you apply 
with your foot is much, much smaller than the force that the drum will 
exert on the rotating wheel. (In practice, this effect is compounded by an 
intervening lever that amplifies the force exerted by your foot.) 


20.3 Archimedes’ principle 


When Archimedes was taking a bath, he noticed that something immersed 
in a fluid seems to weigh less. Imagine that you have attached an object 
to some kind of a spring balance and weighed it so that the —kx of the 
spring was the mg of the object. If you do the same thing again with the 
object now immersed in a fluid, you will find it seems to weigh less. And 
the question is, “How much less?” Archimedes’ answer is very simple. The 
weight loss equals the weight of fluid displaced. Now, how do you show that? 
One way, which I like, is as follows. Look at Figure 20.6, which shows an 
irregular object, say a stone, of weight Mg suspended in a fluid. Now, if 
the thing hanging here is itself a chunk of the same fluid shaped like that 
stone, you will not have to do anything, because that chunk of fluid can 
float at that height for free. But if you now take that chunk of fluid out and 
put a stone there of the same shape, the rest of the fluid doesn’t know what 
youre doing. It applies the same force that it would to support its own 
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Figure 20.6 The irregular object of mass M displaces a volume of liquid equal 
to its own. The loss in its weight is the weight of that body of liquid. This is more 
readily seen for a regular object like the cylinder shown to the right. At the top 
left is a boat whose weight equals the weight of the liquid displaced by the 


fraction immersed. 


kin. Namely, the rest of the fluid is in a configuration ready to support that 
amount of fluid. So, if you take that fluid out and put in something else 
with the same shape, the fluid will apply the same amount of force. The 
rest of the weight is of course your problem. You provide the remaining 
force, the reduced weight. 

Now, one formal way to prove Archimedes’ principle is to consider 
a cylindrical object of area A and height h as shown in Figure 20.6. What 
is the net force of buoyancy? It’s Pg, the pressure at the bottom times the 
area, minus P4, the pressure at the top times the area. We have already seen 
the difference in pressure is Pg — P4 = pıgh where p; is the density of the 
liquid. The net force of buoyancy is then 


Fyuoyancy = At hAg (20.10) 


which is the weight of the liquid displaced. You can extend this result to 
irregular shapes by imagining them to be made up of many arbitrarily thin 
cylinders of various lengths glued together. 

So, basically the body weighs less in water because the lower part of 
the body is being pushed up harder than the upper part of the body is being 
pushed down, which happens because the pressure increases with depth. 

Figure 20.6 assumes the objects immersed are made of materials like 
iron. The weight of that chunk of iron will be more than the weight of 
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the water displaced, and you will have to provide the difference to hold it 
up. But suppose the object was not made of iron, but made of cork. If it’s 
cork, it won’t want to be there, under water. Right? Because then the force 
applied by the water is more than the weight it takes to support it. So, the 
cork will bob up to the surface. The same goes for the human body, as the 
Mafia has figured out: if you want the body to stay under water you will 
have to add weights to it in the form of concrete shoes. Or, closer to home, 
it’s like a rubber ducky in your bath. If you want to keep the rubber ducky 
under the water level, you have to hold it down. You relax for a moment, 
and it will bob right up to the surface. 

Suppose you do not hold it down. We know some fraction f of the 
volume V of the object O (for example, the rubber ducky) is going to be 
inside the water and the rest outside. You can already guess what f is, but 
let’s prove it. Equating the weight of the water displaced to the weight of 
the object gives us 


pwfV)g=poVg or f=". (20.11) 
w 
In other words, if the floating material has 90% the density of water, 
90% of it will be submerged. That’s exactly what happens with ice, which 
has a smaller density than water. Normally, when you cool something it 
decreases in volume and the density goes up, but water actually expands 
slightly when you cool it below 4°C. That’s why ice floats on water and 
why icebergs are mostly under water, and why we have movies like Titanic. 
Archimedes’ principle has many applications. Say we want to build a 
boat of steel. Now, please don’t say, “How do you make a steel boat float 
in water?” It’s not a solid steel boat. If you are thinking about a solid steel 
boat, you should get into another line of work. The boat is made out of 
steel, but it’s completely hollow. Look at the floating boat in Figure 20.6. 
You can easily calculate how deep this one should sink to balance its weight. 
If you give me the weight Mg of the boat and the area A of the base, Ill tell 
you it will sink to depth H such that 


Pw(HA)g = Mg. (20.12) 


Then, of course, you can load some cargo, and the boat will sink even more 
to displace enough water to equal its weight plus that of the cargo. How 
much cargo can the boat take? With the maximum weight of the cargo, the 
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boat is infinitesimally away from being fully immersed, that is, the volume 
of water displaced equals that of the hollow boat. 


20.4 Bernoulli’s equation 


This is the first time we will consider fluids in motion. Once again I will 
only invoke F = ma, but we have to be clever, as when we mentally isolated 
a chunk of water and demanded that it be in equilibrium to find out how 
the pressure varied with depth. 

Figure 20.7 is the standard picture in all the textbooks. This is the best 
we have been able to come up with after three hundred years. The incom- 
pressible fluid of density p is now flowing in a pipe. We pick two points 
along the flow labeled 1 and 2, and we focus on the fluid in between. The 
cross section of the pipe is changing, with values A; and A3; the velocity 
has values vı and v2; the altitude (from some reference height) has values 
hı and hp; and finally the changing pressure has values Pı and P2. Remem- 
ber that in the following discussion hı and h are heights measured upward 
from some reference level, in contrast to the used earlier (in formulas like 
P = P4 + pgh), which stood for the depth measured downward from the 
surface. 

We are going to find a relation between the above-mentioned quan- 
tities at two locations 1 and 2 using the law of conservation of energy. 


20.4.1 Continuity equation 


First a purely kinematical result. If the fluid is incompressible it obeys the 
continuity equation, which says that the rate of volume flow in at 1 equals 
the rate of flow out at 2. What comes in has to go out of this fixed volume 
(between points 1 and 2) because the density cannot change. How much 
water do you think comes in through the pipe at point 1 in a time dt? Can 
you visualize in your mind that, in time dt, the fluid moves a distance dx, = 
vıdt at the left end and therefore a volume A vı dt enters at the left end, and 
likewise at 2 a volume A212dt exits? Equating the input and output rates 
and canceling dt we find 


Aivi = Av, (20.13) 


which is called the continuity equation. 
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Figure 20.7 The flow of a fluid along a pipe of varying cross section, pressure, 
and height. We focus on the part initially between cross-sectional areas A; and 
A2. Ina time dt, it is left unchanged but for a sliver of volume Aj dx; = A) vı dt, 
which has been removed from the left and appended to the right end as a sliver of 
equal volume Azdx2 = A2v2dt. 


Think of cars going down a freeway as the freeway gets narrow. Unlike 
in real life, let us not allow the cars to pile up. We want the density of cars 
to be the same. That means if there’s a narrow section of the road, the 
cars have to go faster to maintain the traffic flow. It follows that if I go 
to one checkpoint and see how many cars pass me per second there, the 
same number will cross anywhere else. So the speeds will be in inverse 
proportion to the width of the road (or area in the case of the pipe) so that 
the product remains the same. 

Now, we are going to find a constraint between the variables at 1 and 
2. Think about what is going to happen before you derive any formula. 
Does it make sense to you that, left to itself, the fluid will slow down on the 
way to the top, because it’s got to work against gravity? Therefore, there’s 
going to be some connection between the height and the velocity of the 
fluid, just from the law of conservation of energy. But you must remember 
that if there are external forces on a system, E = K + U is not fixed but 
changes by an amount equal to the work done by external force, which 
in our case is exerted by the rest of the fluid on either side of the chosen 
volume: 
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E — E; = Work done by external forces. (20.14) 


We are going to apply this to the fluid between these two cross sections 
A, and A2. Imagine it is colored differently from the rest. At every point 
in the region between 1 and 2 the fluid has a certain velocity and height 
and thus a certain potential and kinetic energy at time t = 0. A little later 
this body of colored fluid that we follow mentally has moved a bit to the 
right. If we compare the “before” to the “after,” most of the fluid is doing 
the same thing except that a sliver of width dx; = v, dt and mass A; vı dtp 
is missing at the left and has been appended to the right end as a sliver of 
width dx. = vdt and mass pA2Vv2dt. The change in kinetic energy due to 
this transfer is 


dK = Aydt py} — Arndt; py} (20.15) 
and the change in potential energy is 

dU = Apvndtp ghz — Av dtpghy. (20.16) 

The work done by the pressure at 1 is Fi dx; = +P)A;dx; = P| Aj vı dt 
and the work at the other end is — P2A2v2dt, which is negative because the 


displacement is opposite to the applied force. The work-energy theorem of 
Eqn. 20.14 now says 


1 1 
Apvedts p3 + A2v2dtp gh, — Arndt pyj — Avy dtpgh, 
= Avı dtP — Av dtP2. (20.17) 


Canceling A; vı dt = A2v2dt (continuity equation) from both sides we 
arrive at 


1 1 
32V2 + pg, — pri — pgh = P — Ph, (20.18) 
which is rearranged to arrive at Bernoulli’s equation 


1 1 
P, +527 t+ pgh = Pa + spy) + pghr. (20.19) 
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It is true there’s a pump somewhere pushing the fluid in the begin- 
ning, but you don’t have to go all the way to the pump. In the end you only 
ask what is in contact with the chosen segment of the fluid. It is the fluid to 
the left that is doing work on the segment and fluid to the right on which 
the segment is doing work. 

In a real fluid, the walls will in fact exert a force parallel to themselves. 
There will be a drag on the fluid because the fluid really doesn’t like to 
move right up against the walls. It will move more easily in the middle of 
the tube. Different parts of the fluid will be going at different speeds, and 
there will be a lot of dissipation due to this effect called viscosity. Were 
ignoring viscosity and all other losses in deriving Bernoulli’s equation. 


20.5 Applications of Bernoulli’s equation 


Figure 20.8 shows a tank of water filled to a height H. Your goal is to punch 
a hole on the side at a height h so that the water that sprays out lands on 
the dog’s bowl. This is a two-part problem: (i) find the velocity v2 of the 
water as it emerges horizontally from the hole; (ii) equate the distance it 
travels before hitting the ground to d. 

For point 1 of Bernoulli’s equation choose a point on the surface of 
the water in the tank. Assuming the height drops very slowly due to the 
leak, you may set vı = 0. The pressure P; = Pa, the atmospheric pressure. 
For point 2, choose a point just outside the hole. Clearly hz = h, and P3 = 
P4. We find the velocity v2 using 


Figure 20.8 Choose the height h such that the dog gets a drink. 
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1 1 
Pa + pwgH + 5 Pw = Pa + pwgh + PWY?» which means (20.20) 
v5 =2g(H—h). (20.21) 


The water has the same speed it would have had if it had freely fallen verti- 
cally a distance H — h. However, the body of water coming out of the hole 
is not the water from the top. If you put in some coloring at the top and 
punch a hole, the water immediately coming out of the hole would not be 
colored. 

If the jet of water travels for a time t before hitting the ground, then 
from familiar kinematics, 


1 
h= sa (20.22) 
d= ni. (20.23) 


Eliminating t and using Eqn. 20.21 we find 


d 


which has two solutions spaced equally above and below the midpoint a. 


H+VH?-d@ 
h= — re (20.25) 


If you choose the higher solution for h, the water will come out slowly 
but have a large time-of-flight; if you choose the smaller one, the situation 
will be reversed. Observe that the farthest the jet can go is d = H (beyond 
which the square root becomes imaginary), and for this you must punch 
the hole at the halfway point h = H, 

The next example is the atomizer in the left half of Figure 20.9. There 
is some perfume in the container at the atmospheric pressure P4. You now 
squeeze the bulb that sends a jet of fast-moving air over the nozzle at the 
top at some velocity v2. This lowers the pressure there below P4 and sucks 
the perfume out through the nozzle and blows it on your face. 

The last application of Bernoulli is the Venturi meter shown in the 
right half of Figure 20.9. Suppose that oil of density po is flowing in a pipe, 
and you want to know the flow rate in m/s. It is of course Aj), but you 
only know A; but not vy. So you create a constriction of area Az in the flow 


Fluids 351 


BULB 
= 1 
i a | 
> 
v 
Ll Uh ee 2m P, A, A, 
P 
: Po h 
Venturi meter 
Atomizer 


Figure 20.9 The atomizer and Venturi meter. 


and then hook up a U-tube as shown in the figure. The U-tube has some 
other fluid of density pf > po that does not mix with oil. Because of the 
continuity equation A; v; = A272. The oil will be moving faster in the con- 
striction and the pressure there will be less. This difference is easily related 
to the height h shown in the figure as follows. Start at the solid horizontal 
line where the pressures must be equal in the two legs to some Po because 
they are in a static fluid and at the same height. The pressures Pı and P2 
at the middle of the pipe (dotted line) are then Po plus the corresponding 
phg terms. Canceling a common term on both sides due to the oil above 
the height h, we get 


P; — Po = (pf — po)gh. (20.26) 


Thus the equations we have are 


Le l 2 
Pi + Pov = P + zr (20.27) 
Aj y= A2v2 (20.28) 
P; — P2 = (pf — po)gh which may be solved to give 
(20.29) 


2(pf = po)gh 
= ; 20.30 
" / po[(A2/A) — 1) ee 


CHAPTER 21 


Heat 


21.1 Equilibrium and the zeroth law: temperature 


This chapter—devoted to the study of heat, temperature, and heat transfer— 
sets the stage for our study of thermodynamics. 

You already have an intuitive notion of temperature. Let us begin here 
with what may be new: the notion of thermal equilibrium. Systems are said 
to be in thermal equilibrium when their macroscopic properties, properties 
discernible by macroscopic probes like the naked eye or a thermometer, 
have stopped changing. 

Take a cup of hot black coffee and take another cup of cold pink soda, 
and keep them both thermally isolated from each other and the outside 
world. No matter how long you wait, the coffee will be hot and uniformly 
black and the soda cold and uniformly pink. The coffee and soda are in 
their respective states of thermal equilibrium. 

If you now pour the contents of one cup into the other, there will 
be a period when the system is not in equilibrium in the sense that it 
doesn’t have a well-defined temperature or composition. For example, if 
you just poured the coffee, the hot black coffee is initially on the top and 
the cold pink soda at the bottom. There will be a period of transition when 
you really cannot even say what the temperature of the mixture is. Some 
parts are hot, and some parts are cold; the system doesn’t have a global 
temperature, and it is changing at a macroscopic scale. It also does not 
have a uniform composition; some parts are black, some are pink, and 
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some are in between. But if you wait long enough, until the two parts have 
gotten to know each other, they will turn into some undrinkable lukewarm 
mess, but the nice thing about the mess is that it will have a well-defined 
temperature and uniform composition. Further waiting will not change 
anything. That’s again a system in equilibrium. 

Here’s another example. Suppose you take a gas and you put it inside 
a cylinder with a massless piston of area A on top, as in the left half of 
Figure 21.1. You put on some weights mg, and they exert a pressure P = 
mg/A. (We assume there is no atmosphere outside pushing down on the 
cylinder. We also ignore the pressure pgh due to the mass of the gas because 
the gas is so light.) We say that it’s in equilibrium because the macroscopic 
things, things you can see with your naked eye, are not changing. It’s just 
going to sit there. But if you suddenly remove a third of the weights, the 
piston is going to shoot up, shake around a little bit, and then settle down 
in a new location in a new state of equilibrium after some time. In between 
these two states of equilibrium you will see the piston moving, the gas 
turbulent with eddies and vortices, and the pressure high in some regions, 
but low in other regions. These are non-equilibrium states of the system. 

Here is a summary of the difference between the microscopic and 
macroscopic. At the microscopic level the atoms and molecules that form 
the liquid or the gas are always in well-defined states at every instant, 
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Figure 21.1 Left: The gas is inside the cylinder capped by a piston of area A. The 
weights on the piston exert a force F and pressure È. Right: A thermometer with 
a liquid that expands from the bulb into the thin calibrated stem. 
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even if the system is not in equilibrium. Each molecule has a definite loca- 
tion and velocity, which determine its future in accordance with Newton’s 
laws. But, at a macroscopic level, when you don’t look into the fine details, 
global attributes like temperature and pressure don’t have a well-defined 
value away from equilibrium. Conversely, in equilibrium, although macro- 
scopic attributes like temperature and pressure appear fixed and uniform, 
at a molecular level there is a lot of motion and activity. 

Now, whenever a system is in equilibrium, we can assign to it a tem- 
perature that we call T. Right now we dont know anything about this 
temperature, other than your intuitive feeling for it. So we’re going to build 
it up from scratch. 

Let us begin with the zeroth law of thermodynamics —zeroth law be- 
cause after the first two laws were codified, it was realized that there was a 
notion even more basic. The zeroth law says, “If A and B are at the same 
temperature, and B and C are at the same temperature, then A and C are 
at the same temperature.” 

You are probably asking: “You call this a law?” Yes, it is the key to our 
being able to speak about temperature globally. It is the assumption that if 
I use a thermometer to measure something at one place and then dip the 
thermometer into something that is elsewhere, and it reads the same num- 
ber, then I may conclude these two entities, which never met each other 
directly, are also the same temperature. That means if I bring them into 
thermal contact they will continue to be in equilibrium. That seems pretty 
obvious to you, but the whole notion of temperature is predicated on the 
fact that you can define an attribute that you can globally compare between 
two systems that never met directly, but met a third common system. 


21.2 Calibrating temperature 


Once we have some idea of hot and cold, we wish to be more quantitative. 
Describing somebody as tall or short is a good start but we usually want 
to know how tall—how many feet, how many inches. The need for more 
quantitative information led to the idea, “Let’s find things in the world 
that seem to vary with temperature and use that variation to quantify 
temperature, and build thermometers.” 

Here is an example. You take this metallic meter stick at the National 
Bureau of Standards, kept in some glass case, at some temperature. You 
make a duplicate of it, and you lay it outside the glass case. What you 
find is that if the room is hotter than inside the case, the stick outside it 
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expands to a new length. It contracts when the room is colder. Nothing has 
been done to the stick in the air-conditioned glass case, so this one outside 
must be expanding and contracting. One way to define temperature is to 
correlate the length of the stick with temperature in any unique fashion. 
For example, the temperature could be the length of the stick in centime- 
ters, measured by comparison with the one in the glass case. However, if 
this metallic stick is to be a portable thermometer, we need to compare 
it to something portable that does not expand or contract when heated or 
cooled. We could take a meter stick made of wood and notice that its length 
matches the one in the glass case no matter how hot or cold the room is. 
We could then carry this wooden meter stick and a metallic one and use 
the difference in lengths as a measure of temperature. 

The very first thing we can do is to use the zeroth law to say when two 
things are at the same temperature. You successively dip the metallic stick 
in two buckets containing two fluids, giving it enough time to equilibrate. 
If it ends up having the same length (as measured by the wooden stick), 
you say the fluids are at the same temperature. You could go a step further 
and declare that if the lengths do not match, the greater length corre- 
sponds to the hotter fluid. To specify how much hotter, we may say the 
length of the metallic stick in centimeters (as measured by the wooden 
stick) is the temperature. You may not be used to measuring temperature 
in centimeters, but that unit of measurement is just as good as degrees, 
just as 760 millimeters (of mercury) is an accepted measure of atmospheric 
pressure. 

In practice one picks something a little easier than this metal stick— 
wooden stick combination. We know liquids expand when you heat them. 
If you fill your gas tank on a hot day, you have to leave some room at the top 
for the expansion. One way to measure temperature is to take some liquid, 
put it in a jar, mark the level, and watch the liquid expand or contract to 
a new height and mark that height. (Here we neglect the expansion of the 
jar itself and need to verify that is valid by other means.) You can associate 
each marking with a certain temperature. This mark can be zero, that can 
be 5, that can be 19; you have to make sure that it’s monotonic, so that 21 is 
hotter than 19 in some objective sense, say by noting that some substance 
melts at 21 but not 19. 

A more practical design for the thermometer is shown in the right 
half of Figure 21.1. You have a lot of fluid in a big reservoir or bulb, con- 
nected to a very thin stem evacuated at the top. What’s clever about this is 
that even if the liquid expands by one percent in volume, it can climb up 
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the narrow stem quite a bit, that is, a little bit goes a long way, because the 
stem is so narrow. In fact, the stem is so narrow, it is embedded inside a 
prism that magnifies the mercury or alcohol column so you can see it with 
the naked eye. 

Next we want to design thermometers so that people in different 
parts of the world, different countries, different labs, can all agree. So, we 
will make it possible for everyone to make his or her own thermometer 
by the following recipe. We will dip this thermometer in a bucket that 
contains some ice and water in equilibrium. Coexistence of ice and wa- 
ter seems to occur only at one temperature. That’s called the melting point 
of ice or the freezing point of water. We go to the lake where ice and water 
coexist, we dip the thermometer there, and we find that the reading does 
not change as long as ice and water coexist. It changes only when all the 
water has turned into ice or vice versa. Whatever reading we get we will 
postulate to be 0 degrees centigrade or 0°C. That is just a definition. In 
the thermometer in Figure 21.1, we mark the point on the stem as 0°C. 
But right now you can only tell if another object is at, above, or below 0°C 
using this thermometer with just one marking. 

So we need to find another universally accessible thing, which, as 
you all know, is the boiling point of water. If you take water in a pres- 
sure cooker and put it on a stove, it heats up and heats up and then 
begins to boil and evaporate. As long as water and steam co-exist, the 
temperature remains fixed, as you can tell because the mercury in the 
thermometer does not rise. That coexistence temperature is going to be 
called 100 degrees centigrade or 100°C. You stick your thermometer in 
when this happens and mark the height of the liquid as 100°C. Then, you 
take the interval on the stem between the 0 and 100 markings (Figure 
21.1), and you divide it into 100 equal parts. Because the markings can 
continue outside this interval, this unit defines the temperature outside 
as well. (This is like saying that a meter stick can be used to measure 
lengths more or less than a meter.) If the liquid has gone 79% of the way 
from the 0 mark and the 100 mark, the temperature is 79 degrees. That 
is the centigrade scale. You know there are different scales. You can have 
the Fahrenheit scale, or any other scale in which what you want to call the 
freezing point is a different number. Somebody thinks it’s zero; another 
person thinks it’s 32. You can again assign to the boiling point a differ- 
ent number: 100 or 212, and you can divide this interval into 100 parts, 
180 parts, whatever you like. But the philosophy has the same three steps. 
You find two points that are reproducible conveniently, divide the region 
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between them into some number of equal steps, and name the scale after 
yourself. 

Now, there are some problems with this approach. One is that the 
boiling point of water does not seem to be a very reliable standard. If I boil 
water at high elevations, like in Aspen, for example, it doesn’t seem to be 
as hot as it is at sea level. I know that because when I cook rice, I find it 
doesn’t cook fully when the water is boiling in Aspen, but it does when the 
water is boiling in New Haven. I know it’s boiling earlier in the mountains 
than in the plains, based on a physical phenomenon, the cooking of rice. A 
thermometer calibrated at sea level will not agree with one on the moun- 
tains. So, who’s going to decide what the real boiling temperature is: the 
person on the plains or on the mountain? You have to be careful when you 
say boiling point and freezing point, because these are not independent of 
altitude, pressure, et cetera. 

Nowadays, people have much fancier definitions for calibration, and 
I will tell you a little bit about that later. But, for now, don’t worry about 
the fact that water boils differently at different altitudes; we could all agree 
to calibrate at sea level, and the relevant conditions at sea level are pretty 
much constant all over the world. 

Now we will turn to a far deeper problem that exists even after you 
have figured out reliable and reproducible boiling and freezing points. If 
you make a thermometer with your favorite fluid, say mercury, and I make 
one with alcohol, they will agree at 0 and they will agree at 100 because 
that’s how we fixed it. We rigged it so at 0 everyone says 0, at 100 ev- 
eryone says 100. But how about 75 degrees? I say it’s 75, if my fluid has 
climbed three-fourths of the way to the top. At that point, yours may not 
have climbed three-fourths of the way. In other words, we have the two 
graphs shown in Figure 21.2. Along the x-axis, we measure temperature 
according to, say, the mercury thermometer. Along the y-axis we plot the 
temperatures according to another thermometer, say, alcohol. The straight 
line y = x is provided for comparison and simply equals the reading on 
the mercury thermometer. The two agree at 0 and 100 by definition. The 
curved one does not agree with the reference in between. When the mer- 
cury thermometer says it is 75, the other may say it is 55. You will have 
to pick one liquid and say, “We swear by that liquid, and when that liq- 
uid’s gone halfway toward 100, we'll say it’s 50 degrees.” To pick a liquid 
youll have to have an international convention, and before you know it, 
there will be an argument between the alcohol lobby and the mercury 
lobby. 
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Figure 21.2 The x-axis is labeled by the reading of a reference thermometer, say 
with mercury. Along the y-axis are plotted the readings as per another—say, an 
alcohol—thermometer (curved line) along with a straight line y = x, which is 


just the reading of the same mercury thermometer plotted for easy comparison. 


What if I can show you hundreds of thermometers that agree with 
each other all the way? Clearly we should go with the hundreds that agree. 
Who are these guys? They are the gas thermometers. Here is the explanation 
of how they work. It takes some effort, but it’s worth it. 

First I take a small quantity of gas inside a cylinder at some pressure 
P. The volume V is just the volume up to the piston. This is my reference 
thermometer. I put it inside boiling water until it reaches equilibrium and 
then write down the value of the product [PV] 9, where the subscript 0 tells 
us it is the reference gas. I repeat with the tub of coexisting water and ice, 
and I keep track of the product again. Now I label points on the x-axis by 
the product (see Figure 21.3). To the place on the x-axis corresponding to 
freezing, I assign the value 0°C and I assign the value 100°C to the boil- 
ing point. I divide the interval between the ice value and boiling value into 
100 equal parts. This defines the centigrade scale according to this gas ther- 
mometer. To find the temperature of a body, I place my gas thermometer 
against it, wait for equilibrium, and measure the product [PV] . What- 
ever number on the centigrade markings this product falls on, that is the 
temperature in centigrade. Thus the point [PV]9 = 350 in the figure cor- 
responds to the temperature 75°C. I can continue this scale to the left of 
the ice point and to the right of the boiling point. 

I now consider two more gas thermometers with different amounts 
of gases of different kinds, and I plot their PV as a function of my centi- 
grade scale (based on [PV ]o) measured along the x-axis. Two such lines are 
shown in the figure. The main point is that they are both straight lines. This 
guarantees that the thermometers will agree not only at the end points 
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Figure 21.3 Comparison of different gas thermometers labeled 0, 1, and 2. 

The x-axis is labeled by one and the other two correspond to two other gas 
thermometers, possibly filled with different dilute gases. The linearity of the 
graphs guarantees that the thermometers will agree not only at 0 and 100, where 
they have to, but also everywhere in between given the similarity of triangles abd 
and ace. When [PV], = 350, the temperature is 75°C. 


where they have to, but also in between and beyond. Let us understand 
why. Look at the lower one labeled 2 and the shaded right triangle that 
extends from 0 to 100 in the x-direction and the corresponding values of 
[PV], in the y-direction. The value of [PV], grows linearly between the 
ice and boiling points, as shown by the hypotenuse of the shaded triangle. 
Of course, no one wants to know what [PV], is; we want the correspond- 
ing temperature in centigrade. This means we must associate with each 
value of [PV], a certain temperature. By convention we label the ice point 
as 0°C and the boiling point as 100°C. Next we divide the difference in 
[PV], between freezing and boiling into 100 equal parts, and each one is 
one degree centigrade, as shown in the vertical side of the triangle. This 
way if [PV], rises by 75% of the difference in [PV], between the ice and 
boiling points, the temperature reading according to thermometer 2 will be 
75°C. But this will also be the temperature of the corresponding point in 
the x-axis, for it too would be 75% of the way between 0 and 100 by similar 
triangles abd and ace. By the same logic, thermometer 1, which uses [PV], 
as the measure of temperature, will also agree with the other two every- 
where. Indeed, any gas thermometer whose measure of temperature is via 
PV will be represented by a linear plot, and it will agree with its fellow gas 
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thermometers everywhere. The slope of the plot will vary with the kind 
and amount of gas (for example, doubling the volume of gas at the same 
pressure will double the PV and the slope), but that will only change the 
spacing in PV per degree centigrade. 

The basic point is that a straight line is completely determined by any 
two points on it. 

It is essential that the x-axis itself be calibrated by a gas thermometer. 
If there were a thermometer that disagreed with the gas thermometer at 
intermediate points, and we used that to label the x-axis, the PV plots of 
all the gas thermometers would be curved. 

There appears to be just one requirement for a good gas thermome- 
ter: the gas has to be very dilute. Given this requirement, everybody can 
build and use the gas thermometer. 


21.3 Absolute zero and the Kelvin scale 


Comparison of different gas thermometers (Figure 21.3) reveals not only 
that each gas thermometer is represented by a straight line, but that all 
these lines cross the x-axis at —273.16°C. (In reality, the PV plots stop 
short of 0, and one extrapolates linearly to see where they would hit zero.) 
There is something very special about that temperature because it is shared 
by all gases. It is called the absolute zero of temperature. It’s called abso- 
lute zero for many reasons. One is that, unlike the 0° centigrade, which 
is by no means the absolute lowest possible temperature, the absolute 0 
is the lowest possible temperature. Why? Because the gas pressure can’t 
be reduced below zero. That’s it. It cannot go below having no pressure. 
Later, we'll find reasons why no further cooling is even conceptually possi- 
ble. That will require you to understand what hot and cold mean at the 
atomic level. But right now, Figure 21.3 shows that all gas thermome- 
ters point to this temperature. So people decided that calling the freezing 
point of water 0 is artificial. That’s based on human preoccupation with 
water. But if you think laws of physics describe the whole universe, what 
about planets where there is no water? Suppose you're talking to a differ- 
ent civilization: Planet of the Apes, say. You tell them, “Apes, we’re going 
to sync our temperatures; 0 is when water freezes,” and they say, “What is 
this thing called water?” You cannot identify water as the stuff they drink, 
because you don’t know what these apes are drinking. Maybe they drink 
methane or liquid hydrogen. On the other hand, if you say, “Take any gas 
and wait until the product of the pressure and volume go to zero, and then 
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call that zero,” that will be a universal standard. It’s not tied to something 
called water. The absolute zero is referred to as OK, where K stands for 
(Lord) Kelvin. 

Having picked the zero, we need one other temperature to form a 
scale. It was decided the triple point of water would be assigned a value 
273.16K. What’s the triple point of water? Water and ice can coexist along 
a line in the P — T plane, and water and steam can coexist along another 
line. There is a certain magical point at which the lines meet, and there 
ice, water, and steam can coexist. The system cannot choose among those 
three options. We will assign to that point a temperature +273.16 K in 
the Kelvin scale. (For our purposes the triple point is essentially the freez- 
ing point, which we called “Freeze” in the figure, at a particular pressure.) 
The centigrade and Kelvin scales differ only in the location of the zero; a 
change of 1° in the Kelvin scale is also a change of 1° in the centigrade scale. 
The difference between absolute zero and the triple point of water is 273.16 
degrees in both scales. 

Now there is a rule. (I do not say “unwritten” for it is written.) You 
may say, “50 degrees centigrade,” but you're not supposed to say, “50 de- 
grees Kelvin.” You have to say “50 Kelvin.” I keep forgetting this but so far 
nothing terrible has happened to me. But you should remember this rule 
when you take the GRE or go for a job interview. Once you get tenure, 
you too can say “50 degrees Kelvin,” whenever you want, with no dire 
consequences. 


21.4 Heat and specific heat 


Heat is denoted by the symbol Q. What are we talking about when we 
talk about heat? Again, let’s use your intuition. Say we have a bucket of 
water and we want to heat it up. We put the bucket on top of a stove that 
we think is hotter than the water, and when the two are brought together, 
somehow the water begins to feel hotter and hotter. We say we’ve heated 
the water, and we say we have transferred heat. Now, scientists were not 
always sure what really was being transferred. What goes from the stove to 
the water? Why is it that the stove, if it’s not plugged in, gets cooler as the 
water gets hotter? Some theorists imagined there was a certain caloric fluid 
that is abundant in hot things, and not so abundant in cold things. When 
we put hot and cold together, this fluid flows from hot to cold, and in the 
process heats the cold thing. It was decided to measure heat transferred in 
calories. So next we have to define a calorie. You want to ask, “How much 
heat does it take to raise the temperature of this bucket of water by ten 
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degrees?” Here is the rule: The number of calories you need is equal to the 
mass of water in grams times the change in temperature 


AQ=mAT. (21.1) 


In other words, if you had 4 grams of water, and you did something 
to it and the temperature went up by 7 degrees, you have, by definition, 
pumped in 28 calories. If this was a kilogram of water, this would be 
28 kilo-calories. Sometimes we use grams and calories; sometimes we use 
kilograms and kilo-calories. But the definitions are consistent: if you put a 
kilo in the gram, put a kilo in the calories. 

Now, suppose you want to talk about heating something else, say a 
gram of copper. Then you write down the following rule. The amount of 
heat it takes to heat up anything must be proportional to the amount of 
stuff youre trying to heat up. That’s our intuitive notion. If you have one 
chunk of gold that takes some number of calories, and you have a sec- 
ond identical chunk, that should take the same number of calories. If you 
put them together, it is clear that whatever this caloric fluid is, you will 
need to double what was required by one chunk. So, AQ has to be pro- 
portional to the mass of the substance. And it’s got to be proportional to 
what you're aiming for, namely, increase in temperature. But this is true 
for any substance, whether you're heating copper, wood, or gold. No mat- 
ter what you are heating, the heat need is proportional to mass and to 
the change in temperature. So, what is it that distinguishes one material 
from another? We introduce the number c here, called the specific heat, and 
write 


AQ=mcAT. (21.2) 


The specific heat c is the property of that material, and we will soon 
discuss how it is to be measured. Equation 21.1 tells us c= 1 cal/(g- K) for 
water: 


Cy = lkcal/(kg- K) = 1cal/(g - K). (21.3) 


You have to understand that formulas will depend on certain param- 
eters in a generic way, and on other things in a material-specific way. In 
this example the dependence on m and AT is generic, while that on c is 
specific to some material. 
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Here is another example of this logic. I can ask by how much a rod 
will expand if I heat it by AT. The increase AL has to be proportional to 
the original length. To see this, take a meter stick that expands by some 
amount and put another identical meter stick next to it. The two-meter 
stick will clearly expand by twice as much as the one-meter stick. So, we 
put the length L in the right-hand side. There is, of course, the AT, which 
causes the expansion. No matter what you are heating—a block of wood 
or a block of steel—it is true that 


AL LAT. (21.4) 


But then the fact that heat has different effects on copper versus wood 
is indicated by introducing a number a, called the coefficient of linear 
expansion, which depends on the material, to obtain 


AL=aLAT. (21.5) 


So copper will have a certain a, iron will have a different a, and 
so on. Suppose you say, “Well, I had some material and when I heated 
it up by one degree, its length increased by nine inches; another one in- 
creased by two inches.” Is it clear that the first one expands more readily? 
No, because the first one could have been a mile long, and the second 
one could have been ten feet long. So, you have to take out certain fac- 
tors that are universal, and the rest of it you put into a property of the 
material. 

Given that water has (by definition) a certain specific heat cy = 
1 kcal/(kg - K), I can measure the specific heat of other materials as fol- 
lows. I take a container with some water in it. Let’s assume the container 
has no mass, so I don’t have to worry about it. The water of mass m is at 
some initial temperature T>. I have some new material, lead, and I want to 
find its specific heat. So, I take the lead of mass mm; in the form of pellets, I 
heat the pellets to some temperature T), and I drop these pellets into this 
water. When I put the pellets into the water, there will be a period when 
the temperature is not defined. Soon the water and pellets will settle down 
to some common, final, equilibrium temperature called Ty and will have a 
mass mı + m. 

We will now postulate that the total change in Q is zero. In other words, 
if heat Q is lost by one body and gained by another body, the loss and 
the gain must equal. It’s a new law, called the conservation of heat. You 
can make up all the new laws you want. You don’t know if they’re right, 
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Figure 21.4 Calorimetric problem: What will be Ty? 


but this is the law you first make up. What can you say in this particular 
problem? I urge you to draw the picture in Figure 21.4 for such calorimet- 
ric problems. It shows the masses m; (lead) and my (water) at temperatures 
T; and T, before, and their final state at T+ after they are combined. The 
sum of all the A Q's is zero: 


AQ + AQ. = mc (Ty — Ti) + mo(Ty-— Th) =0. (21.6) 


Note that the AQ for the water will be positive because Tp > Th (water 
gains heat) while the lead pellets, which lose heat, will have a negative AQ 
because Ty < T). 

The mass of the water is m, the mass of the pellets is mı, and ~ = 
lkcal/(kg- K). You can measure 7), T2, and Ty and hence solve for cy. 

It turns out that the specific heat of materials is not really a constant; 
it can vary with temperature and generally vanishes as T — OK. There 
is a big industry calculating the specific heats of materials as a function 
of T, starting from atoms and quantum mechanics. We can treat them as 
constants over some limited range, say, near room temperature, which is 
roughly 300 K. 

None of the things treated as constants is ever truly constant. The 
previous description with constant specific heats was espoused before we 
even knew about atoms. Physicists were doing the best they could. They 
found empirically that using the specific heat of lead, determined as above, 
and the specific heat of gold, determined the same way, in a third exper- 
iment performed with lead and gold using those values of specific heat, 
the AQ’s indeed added up to zero, that Q was conserved. But this was 
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based on the best possible measurements of that epoch. In a later epoch, 
when more accurate measurements were performed, over a bigger range 
of temperatures, it was found that the specific heat could not be taken as 
temperature-independent, if calories were to be conserved in calorimetric 
experiments. 


21.5 Phase change 


Now for a twist. I take some ice at —30°C. I’ve gone to centigrade now so 
we can better relate to ice. I put the ice on some source of heat, a device 
that will pump in a fixed number of calories every second. As a function 
of time, I’m expecting the temperature of the ice to go up. Every second, I 
get some number of calories, and those calories are going to produce for 
me some mc; AT where c is the specific heat of ice. Remember the specific 
heat of ice is not the same as the specific heat of water. Even though ice is 
also made up of water molecules, the calories needed to heat one gram of 
ice by one degree is roughly half what it takes to heat one gram of water. 
Because m and c; are constants, AQ is proportional to AT. The rate of 
temperature rise will be proportional to the rate at which the heat flows 
into the system. The temperature of the ice goes initially from —30°C to 
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Figure 21.5 Different phases of water from ice below 0°C to water above 100° C. 
As time t increases, a steady flow of calories first heats the ice below 0°C to A, 
where it melts at 0° C between A and B. The water then heats up between B and 
C, and then it evaporates at 100° C between C and D, before becoming 
superheated steam. 
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—20°C to —10°C and so on as in Figure 21.5. But once it hits 0, point 
A, it gets stuck. Although I know heat is coming in, the ice is not getting 
hotter. Then I notice that the ice is beginning to melt. There will be a period 
when, as I pump in calories, I don’t get any increase in temperature, but 
I get conversion of ice into water. This is called a phase change. A phase 
change occurs when a substance changes its atomic arrangement, in this 
case, from a regular array that forms a solid into a liquid whose atoms are 
free to run around. 

An atom in a solid is like a kid in a classroom: you have an assigned 
seat but you can fidget in that seat. A liquid in a container is more like 
a playground with a fence; the atoms get to run around within a certain 
contained space. 

Return to the period when there is some water with chunks of ice 
floating on it, between A and B. Until all the ice is converted to water, 
the whole system is stuck at that temperature, 0°C. It is called “No ice 
left behind.” That’s a very interesting property. Now, if you really put a 
pot on a stove and drop a chunk of ice in it, you know what will hap- 
pen. The bottom of the ice will melt; it may even boil and evaporate, 
yet the top may still be solid. That’s not what I’m talking about, because 
that’s not a system where there’s a globally defined temperature. I want 
you to heat the ice ever so slowly; the minute you put in a little bit of calo- 
ries, you give it enough time for the whole system to share that heat, so 
that the whole system has one single common temperature. When you 
do this, you will find the temperature stays fixed until all the ice has 
melted. 

The caloric price you have to pay to bring about the melting is called 
the latent heat of fusion, L. It has a value of 80kcal/kg for water, and it varies 
from substance to substance. 

Once everything has become water, and the point B is reached, the 
uniform system of water starts increasing in T. I guess you know the next 
stopping point. When you come to 100°C, point C, again it gets stuck 
until everything vaporizes, and beyond point D it is all steam. Then, you 
can have superheated steam, which is even higher than 100°C. The caloric 
cost involved per kilogram in this phase change is the latent heat of vapor- 
ization, Ly. It is around 500Kcal/kg. That’s information I don’t carry in 
my head. 

So, if I give you some ice at —30°C and I give you a budget of 5,000 
calories, where will it end up? Look at Figure 21.5. You have to first spend a 
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few calories going from below A to A at 0°C, and, if you have some calories 
left, you can start melting the ice; maybe you'll run out of stuff there and 
get stuck between A and B, with a mixture of ice and water. If you have 
still more calories at your disposal, you can melt it all (point B) and start 
heating the water and keep going till it is all steam, and heat that steam and 
so on. 

The kind of problems you can get are fairly simple most of the time. 
The only time when you can really get in trouble is in the following situ- 
ation. Suppose I mix some water at +20°C and some ice at —40°C. What 
will be the end product? Now, this is a subtle problem. If you had water at 
+20°C and you added more water at +60°C, you can easily guess that it 
would end up as water at an in-between temperature, which you can easily 
calculate. Now it’s more subtle. The answer will depend on how much ice 
and how much water you have. If by water at +20°C you mean the At- 
lantic Ocean and by ice you mean a couple of ice cubes, we know what’s 
going to happen. These ice cubes are going to get clobbered; they’re going 
to melt. You will end up with all water. Then, you can easily calculate the 
final temperature T; of the water by writing down (all in centigrade) and 
solving 


0= Matw(Ty — 20) + Mjc;(0 — (—40)) + MiL + M;icw(Tf — 0) 
(21.7) 


= Macw(Tf — 20) + M;ci40 + MiL + MicwTf (21.8) 


where M, and M; are masses of the Atlantic and the ice cubes, c; and cw 
the specific heats of ice and water, and L the latent heat of melting. The 
first term is the AQ for the ocean as it drops from 20°C to Tf. The next 
is the AQ for ice as it goes from —40°C to 0°C, the next is the heat of 
melting for the ice, and finally the heat gained by the melted ice to become 
water at Ty. 

In general, given some ice below zero and some water above, you will 
not know in advance whether you will end up with all ice, a mixture at 0°C, 
or all water. You can first make the optimistic assumption that you will 
end up as all water at an unknown temperature Ty. Write your equations, 
including the heat it takes to melt the ice and solve for Ty. If you get a 
positive answer you can use it, because the assumption that you heated 
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up the ice, you melted the ice into water, and then heated up the water 
from 0 to the final water at Ty is correct. But if you did the calculation 
and obtained a negative value of Ty, that result is inconsistent with the 
assumptions in the equation. Then, you can try something else; you can 
assume it is all ice at the end. If you think Ty is down there, below A, 
then you've simply heated the ice from very cold ice to not-so-cold ice. 
You brought this water down from T > 0 to 0, sucking out the mcAT to 
do that, then you've taken out the latent heat of melting to turn the water 
at 0 into ice at 0, and then you've cooled that ice down to the final Ty below 
A. Then, all those losses of the original water equal the gain of the initial 
ice. If, when you solve for Tf, you get a negative number, then you're okay. 
That will certainly be the case if I sprinkle two drops of water on a big 
iceberg; we know it’s going to end up as ice. 

But if I give you numbers that are wishy-washy, when it is not so 
clear whether ice will win or water will win, you may be forced to consider 
a third possibility if these two fail: you end up with some amount of water 
and some amount of ice at 0 degrees, between A and B. Then the question 
is not what the final temperature is—it is 0—but how much is ice and how 
much is water. 

You solve that as follows. Start with the water at some point between 
Band C, say at 25°C. You extract heat AQ = myc,25 from that water to 
bring it to water at 0°C, which is the final equilibrium temperature. You 
start giving that heat to the ice, first to move it from below A to A. It does 
that by absorbing that mjc; AT, the mass of the ice times specific heat of 
ice times AT. If the ice was at —40°C, then AT = +40. You certainly have 
enough heat to do that because you know that, in the end, any ice we have 
is at 0°C. Whatever remains of the heat you extracted from the water, you 
now use to melt the ice at the rate of L kcal/kg. The unmelted ice is the 
amount of ice at the end, and the rest of the mass is water. 


21.6 Radiation, convection, and conduction 


We now discuss different ways of transferring heat. 

Radiation is heat energy transfer from some body to another without 
the benefit of any intervening medium, like heat from the sun, which is re- 
ally electromagnetic radiation. Electromagnetic radiation doesn’t need air; 
it doesn’t need any medium to propagate. In fact, if it needed a medium 
like air, we would not get any heat from the sun because between the 
earth and the sun is mostly vacuum. Thus, if you were in front of one 
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of these space heaters, with glowing red coils, and you were feeling warm 
and someone started pumping the air out of that room, your last thought 
would be, “Yeah, but I am still warm.” 

The second way of heat transfer is called convection, explained by the 
following example. You have water in a pan; you put it on a hot plate. Then, 
in the lower part of it, the water gets hot. When it gets hot it expands, 
and when it expands the density goes down; therefore, by buoyancy it will 
start raising up. Remember, a chunk of water belongs in water. A chunk 
of something else with lower density will float to the top. But the point 
is, water doesn’t have a fixed density. If you heat it up, the density goes 
down, so the water downstairs has a lower density—like a piece of cork, 
it will rise to the top. When it rises to the top, the cold water with the 
higher density will fall down. So, you set up a current. Hot rises to the top 
and cold comes down. And this also happens in the atmosphere. On a hot 
day, the air next to the ground gets really heated up and it rises, and the 
cold air comes down, and this process creates thermal currents. So, here 
youre trying to equalize the temperature between a region that is cold and 
a region that is hot by the actual motion of some material. In radiation, you 
don’t have the medium transferring heat because a medium is not even 
present. In convection, the medium actually moves and by that process 
heat is transferred. 

The final form of heat transfer, the one I want to focus on more 
quantitatively, is conduction, which is something you've all experienced. 
Why does a skillet have a wooden handle? There is a simple reason: given 
that your body is at 98 degrees and the skillet is, say, 200 degrees, a 
steel handle would have heat flowing from the skillet to your hand. We 
want to understand the rate at which heat flows from the hot end to the 
cold end. 

I’m now going to introduce a new term called heat reservoir. A reser- 
voir is another body like you and me, except it’s not at all like you and 
me. It’s enormous. It is so big that its temperature cannot be changed. If 
you can sit on it, you will fry and you'll evaporate, but its temperature will 
not change. No body is really a reservoir. If you drop an ice cube in the 
Atlantic, you'll lower the temperature of the Atlantic but only by a negli- 
gible amount. So, take the limit Atlantic goes to infinity, and you have a 
reservoir. Reservoirs have one label, namely, their temperature. The room 
in which you are sitting is a good approximation to a reservoir. If you put 
a cup of hot coffee here, we say it will come to room temperature. Actually, 
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Figure 21.6 Heat flow from a hot reservoir to a cold one through a conducting 
rod of length L and cross-sectional area A. 


the room temperature rises a bit to meet that of the coffee, but this increase 
is negligible in practice. We can attribute to the room a temperature quite 
independent of bodies that go in and out of it. 

Look at Figure 21.6. A rod of length L and cross section A is con- 
nected at the right to a reservoir at T) (maybe a tank of water-ice mixture 
at 0 degrees) and at the left to a reservoir at T; (say a water-steam mixture 
at 100°C). We know heat is going to flow from the hot end to the cold end. 
We want to write a formula for how much heat flows per second. What 
will it be proportional to? Cross section is one correct answer. We can un- 
derstand this result as follows. You take one rod, and for convenience let’s 
just take it to be of rectangular cross section. Take another identical rod, 
and it will also transfer the same amount of heat for a given amount of 
time. Just glue them together to form a rod of twice the cross section. It’s 
going to transmit twice the amount of heat, and it has twice the area. So, 
the rate of heat flow is going to be proportional to the cross-sectional area 
A. And why is the heat flowing? It’s flowing because of a temperature dif- 
ference AT. So, that’s always there; that’s the underlying cause behind heat 
transfer. That’s the dynamics in thermodynamics; that’s what makes the 
heat flow. But then, we find as an empirical fact that the rate of heat flow 
decreases as the distance between the reservoirs, the length of the rod, in- 
creases. Heat flow seems to depend not just on the temperature difference 
but on its gradient in space. So, you want to divide by L, the length of 
the rod separating the hot and cold ends. These features of heat transfer 
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happen to be truly independent of what material we are talking about. So 
far we have 


dQ (i= T) 
x AÅ ; 
dt L 


(21.9) 


Again, having put all these universal factors, you have to ask, “How 
does the formula differentiate between a copper rod and a wooden rod?” 
The answer again is that you have to put one more number x, the thermal 
conductivity, which depends on the material: 


dQ m- Ta) 
Gt TRA (21.10) 
The minus sign in front says that if T decreases with increasing x (that is, it 
gets colder as we go to the right), the heat current flows along the positive 
x-direction. 

Let us raise the same question once more with feeling: “I have two 
reservoirs, hot and cold. I connected them with two different rods. This rod 
carried twice the amount of heat per second as the other rod. Is it necessar- 
ily a better conductor?” No. Maybe it had 10,000 times the cross section. 
So, what you want to do is to make the playing field level: compare rods 
of the same cross section, same temperature difference, and same length, 
and then ask which conducts more heat. That depends on the material and 
that’s the role of x. 


21.7 Heat as molecular kinetic energy 


In the old days, people just said that heat was a caloric fluid, and they 
postulated the conservation law for the fluid. You can postulate what you 
want, but you have to make sure it works. The conservation of calories did 
seem to work, in the sense that all the AQs in any reaction did seem to 
add up to zero. But then people started getting hints that this fluid was not 
conserved and that this thing we call heat is not entirely independent of 
other things we have studied. 

So, where do you get the clue? When we studied mechanics, we talked 
about two cars in a totally inelastic collision—they slam into one big static 
lump; in the end we have no kinetic energy and no potential energy. We 
just gave up and said, “Look, conservation of energy does not apply to 
this inelastic collision.” On the other hand, we find that in such a collision 
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the bodies become hot. So calories are not conserved either: they are cre- 
ated in a collision. There are more clues. You take cannon balls and drop 
them from a big tower. When they hit the sand, they start heating up. Or 
you drill a hole in a cannon to guide the ball, which is what Count Rumford 
did in 1790, and you find that you need to constantly pour water to keep 
the drill bit from heating up. You find all the time that, when mechanical 
energy is lost, things heat up, that is, calories appear from nowhere. So, 
you get a suspicion that maybe there’s a rule connecting the mechanical 
energy that you cannot account for, that seems lost, to a corresponding 
gain in calories. Maybe the conservation of mechanical energy and of heat 
could both be salvaged into a single law of conservation of energy if joules 
measure energy you can see, and calories energy you cannot see. That is 
the premise. If that is the case, we must first determine the exchange rate 
between calories and joules. 

James Prescott Joule performed the experiment shown in Figure 21.7. 
You have an insulated container in which there is water and a shaft attached 
to fins. The shaft can rotate. At its top is a pulley with a weight hanging as 
shown. There is a rope wrapped around the shaft, and when you let this 
weight go down, it’s going to spin the shaft. The fins churn up the water. 
Now, you can keep track of how much mechanical energy is lost, right? 
This mass was at rest at first and it lost potential energy Mgh as it came 
down. Let’s say it’s got some kinetic energy at the end that is less than 


Litt 
Piri 


Figure 21.7 Joule’s experiment to find the mechanical equivalent of heat, 

4.2kJ/ kcal. As the weight descends, the fins turn and heat up the water. The 
missing mechanical energy in the weight becomes heat energy. (It is assumed that 
the rest of the apparatus does not absorb any appreciable part of the generated 
heat.) 
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Mgh. Some number of joules are gone but the water heats up. When the 
water gets hot, you can ask how many calories were supplied to the water 
by looking at the mass and specific heat of the water and the increase in 
temperature. Then you ask if there is a proportionality between the joules 
and calories. You find it is 4.2 joules to a calorie, called the mechanical 
equivalent of heat. In other words, if you can expend 4.2 joules of mechani- 
cal energy, you get one calorie to be used for heating. In the example of the 
colliding cars, one car had some kinetic energy, and the other had some ki- 
netic energy, all measured in joules; they slammed together, and they came 
to rest. That means you can take all those joules, divide by 4.2, and get 
some number of calories AQ. Then those calories will produce an increase 
in temperature of the cars, such that AQ = mcAT, where c is specific heat 
of whatever material the cars were made of. 

In practice, there will be other losses. You heard the crash: that’s some 
sound energy gone; you won't get it back. Some sparks were flying; that’s 
light energy that’s gone. You subtract all that out, and you find that in the 
end, the calories explain the missing joules. 

Counting heat as another form of energy, we can save the law of con- 
servation of energy. It is not violated, even during inelastic collisions, if 
you include heat as a form of energy using 4.2 joules per calorie. What 
right do we have to call heat energy? When we say some kid is energetic, 
we mean the kid is always running around mindlessly, back and forth. En- 
ergy is associated with motion. These two cars were moving, and we have 
every right to say they have kinetic energy. How about potential energy 
then? Well, if the car starts climbing up a hill and slows down, we think 
it has potential. If you let it go, it will come back and give you the kinetic 
energy. So, most people think of energy as just kinetic energy. That is what 
seems to have been lost in the collision. And yet, you get calories in return, 
so you ask yourself, “How is the calorie related to kinetic energy?” 

The correct answer to that came only when we understood that ev- 
erything is made of atoms. It turns out that the kinetic energy of atoms is 
what we call heat. But you have to be very careful. I take a cylinder full of 
very cold gas. I throw it at you. That whole cylinder is moving. That’s not 
what I call heat; the cylinder is not hotter due to its overall motion. That is 
motion you can see. I’m talking about a cylinder of gas that doesn’t seem 
to be going anywhere; yet it has motional energy because the little guys 
inside are going back and forth. If you kept track of the kinetic energy of 
every single molecule in this car, every single molecule in that car, and you 
added them up, you would get exactly the same number, before and after. 
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The only difference will be that, originally, each car also had some global 
common velocity, macroscopic velocity you could see. On top of it, it had 
random motion of the molecules that made up the car. So did the other 
car. When they slammed together, the macroscopic motion was traded for 
thermal motion. 


CHAPTER 22 
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22.1 Recap 


In the last chapter we took the notion of temperature, for which we have 
an intuitive feeling, and turned it into something more quantitative, so you 
can not only say this is hotter than that, you can say by how much, by how 
many degrees. We agreed to use the absolute Kelvin scale for temperature 
and to use the product of PV of a gas thermometer as a measure of tem- 
perature T. The Kelvin scale has its origin (T = 0) at —273.16°C, which 
is where PV vanished for any dilute gas. In other words, it appears that 
pressure times volume is some constant times this temperature T. That’s 
the scale chosen by nature, and it doesn’t seem to depend on the gas that 
you use. I can use one; you can use another one. People on another planet 
can use a different gas. There is really nothing below this T = 0. 

The next thing I mentioned was that initially people believed in the 
caloric fluid. Hot things have a lot of it, and cold things have less of it, and 
the caloric flows from hot to cold. Caloric is conserved, which allows you 
to do some problems in calorimetry. That promoted heat to a new and 
independent entity, different from all other things. 

Then it turned out that heat was not so unrelated to other things 
in mechanics. You can heat up water by putting it on the stove, and the 
caloric fluid flows from the stove into the water. But there’s a different 
way to heat the water. This is not the most economical way to make your 
coffee but Pm just telling you as a matter of principle. Buy two Ferraris, 
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slam them into each other, take a pot of water, and put it on top of the 
cars. It will heat up because the Ferraris will be hot. What happened to the 
kinetic energy of the two cars? It is really gone. In the old days, we would 
have said that we cannot apply the law of conservation of energy to this in- 
elastic collision. That was our way out of the whole issue. But then we had 
another problem: caloric fluid was not conserved, because slamming the 
two cars produced this extra heat. So both the law of conservation of en- 
ergy and the law of conservation of the caloric fluid seemed to be violated. 
Luckily both could be saved if heat was counted as a form of energy. When me- 
chanical energy disappears, a definite amount of heat energy appears. How 
many calories can you get if you sacrifice one joule of mechanical energy? 
Joule did the experiment with his gadget, which had a shaft with some fins 
that turned in some water as some weight went down and heated the wa- 
ter. Equating the loss of mechanical energy to the calories needed for the 
heating, he found the mechanical equivalent of heat: 4.2 joules = 1 calorie. 


22.2 Boltzmann’s constant and Avogadro’s number 


When we say something is hotter, what do we mean on a microscopic level? 
The answer is based on the profound fact that everything is made up of 
atoms. 

Take the simple example where temperature enters: the gas ther- 
mometers from the last chapter. The linearity of the PV plots as a function 
of T means that PV is proportional to the absolute temperature T. What 
parameters do you think enter the right-hand side of the proportionality 
relation 


PV xT? (22.1) 


The amount of gas is correct. The amount of atoms is even more correct. 
But suppose you were not aware of atoms. Then what would you mean 
by “amount of gas”? It would have to be the mass. What’s the reasoning? 
We know that if you have some amount of gas producing the pressure in a 
box, and you put in twice as much stuff, it should produce twice as much 
pressure. That’s actually correct. We can write for one particular gas sample 
of mass m: 


PV x mT. (22.2) 
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As usual, after these generic factors are incorporated, you still need a 
constant of proportionality a that is specific to the gas. (It will be re- 
named before you get a chance to confuse it with the coefficient of linear 
expansion.) So now we write 


PV =amT. (22.3) 


Experimentally we find that a depends on the gas, not unlike the spe- 
cific heats of various substances. But there is an intriguing relation between 
the a’s for various species and the ay for hydrogen. For this discussion, and 
to grasp the main point, let us pretend that there are only three atomic species, 
hydrogen, helium, and carbon, and that as gases they always occur in atomic 
and not molecular form. It is found that for helium a He = la H and for car- 
bon ac = ba H. In other words, one gram of helium produces the same 
pressure of i grams of hydrogen, and one gram of carbon produces the 
same pressure as 5 grams of hydrogen (at some fixed V). 

If my, MHe, and mc denote the masses of H, He, and C, the preceding 
results may be summarized as follows: 


PV =aymyT = apy || T (22.4) 
= arempeT = myeT = an |] T (22.5) 


a m 
=acmcT= Z mcT =an| —| T andsoon. (22.6) 


So it looks like the mass of the gas has to be divided by numbers like 
4 or 12 to find its true effectiveness in contributing to pressure and that 
if these rescaled masses are used, the same proportionality constant ay may 
be used for all gases. What do these rescaled masses mean and why are the 
rescaling factors nice round numbers? Here is the answer, with no details 
of how it was arrived at, because that would take way too long. 


e All matter, including gases, is made of atoms (and also molecules, but I 
will refer to them all as atoms). 
e The masses of hydrogen, helium, and carbon atoms are in the ratio 


H:He:C=1:4:12. (22.7) 
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Thus in Eqns. 22.4—22.6, the mass of each gas is really being divided by 
a number proportional to the mass of the constituent atoms. The result 
is clearly proportional to the number of atoms in each sample. In other 
words, PV is proportional to the product of T and N, the number of 
atoms in the sample with a constant of proportionality that is independent 
of the gas. 

Thus we may write 


PV = kNT (22.8) 


where this new constant k = 1.38-10~7°J/K is independent of the gas and 
is called Boltzmann’s constant. 

Bear in mind that we are discussing an ideal gas, one whose atoms 
move independently of each other with no interatomic force. Gases tend to 
become ideal at low densities, when the typical separation between atoms 
is large and the interatomic forces negligible. 

The number N of gas atoms in a typical sample is huge. In particular, 
one gram of the lightest atom, hydrogen, has N4 = 6 - 107 atoms, where 
Ng is called Avogadro’s number. (So, N4 is the reciprocal of the mass of 
a hydrogen atom in grams.) It’s simply a number, like a dozen, and it is 
also called a mole. Sometimes the word mole is also used to stand for the 
amount of any substance that has a mole of atoms in it. Thus a mole of 
carbon has a mass of 12 grams. It is a natural unit to use for counting 
atoms, the way a dozen is a natural unit for eggs, a light year is a natural 
unit for cosmological distances, and a kilogram is a natural unit for the 
mass of humans. 

In terms of n, the number of moles defined by 


i=, (22.9) 


the equation of state, the relation between P, V, and T for the ideal gas 
becomes 


PV = NkT = nNgkT (22.10) 
= nRT where (22.11) 
R=Ngk=8.31 J (22.12) 


K-mole’ 
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where R = Nak is the universal gas constant, equal to 2 cals/°C/mole, easy 
to remember and the same for all gases. Initially R was what was measured 
and people worked with the moles n. Then they looked under the hood 
and found that the gas is made of individual atoms, and they switched 
from the macroscopic parameters n and R to the microscopic N and k. 


22.3 Microscopic definition of absolute temperature 


Look at PV = NkT = nRT. Is there a microscopic basis for this equation? 
In other words, once we believe in atoms, do we understand why there is a 
pressure at all in a gas? For this purpose, we will take an L x L x L cube of 
gas as shown in Figure 22.1. Inside this is some gas and it has some pres- 
sure, and I want to know the value of the pressure. Consider the shaded 
face of the cube. It has to be nailed down to the other faces; otherwise, 
itll just come flying out because the gas is pushing it out. The pressure 
is the force on this face divided by area. The atoms are constantly bounc- 
ing off the wall, and every time one bounces on a wall, its momentum 
changes. So, who’s changing the momentum? Well, the wall is changing 
the momentum. It’s reversing it, for example, if the atom bounces head-on 
and goes back, as in the figure. That means the atom pushes the wall with 
some force, and the wall pushes back with the opposite force. It’s the force 


y 


L 


Figure 22.1 A cubic box with N gas atoms inside. The figure shows an atom of 
velocity v hitting the shaded wall and reversing its velocity. The corresponding 
momentum change is due to the wall, which exerts a force on the atom. The 


opposite force the atom exerts on the wall is its contribution to the pressure. 
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that the atom exerts on the wall that I’m interested in. I want to find the 
force on this particular face and divide by its area to find the pressure. You 
can find the pressure on any face; it is going to be the same in equilibrium. 
Ihave N atoms, randomly moving inside the box, each suffering col- 
lisions with the walls, bouncing off like a billiard ball would at the walls of 
the pool table, and going to another wall and doing it again. Now, that’s 
a very complicated problem, so we’re going to simplify it. We are going to 
assume that one-third of the atoms are moving from left to right or right 
to left (perpendicular to the shaded face), one-third are moving up and 
down, and one-third are moving in and out of the page. Of course, you 
will have to assign equal numbers to these directions, for nothing in the 
gas favors any one. So iN atoms are going back and forth between this 
shaded wall and the one opposite to it. The figure shows you a side view. 
The force on an atom is the rate of change of momentum: 


dv d(mv) dp 
F = ma = m— = = — 


dt dt dt 


(22.13) 


Next we assume all the atoms have the same speed, which I’m going to call 
v. Take one particular atom that hits the shaded wall and bounces back. Its 
momentum changes from mv to —1; therefore, the change in momen- 
tum is 2mv. How often does that collision take place? Once it hits that wall, 
it has to go to the other wall and come back. It has to go a distance 2L, at 
a speed v. The time it takes is a and the frequency of collisions with the 
shaded wall is 57. So 


d 
= momentum change per collision x collisions per second 
v 
=2mv. —. (22.14) 
2L 


That is the force due to one atom. It’s not a continuous force. The atom 
hits the wall, there’s a little force exchange between the two, then there’s 
nothing, and then you wait until it comes back and hits the wall again. 
If that were the only thing going on, most of the time there would be no 
pressure, and suddenly there would be a lot of pressure, and then nothing. 
But, fortunately, this is not the only atom. There are roughly 107° guys 
pounding against the wall. In a short time, even 1077 seconds, there will 
be a large number of atoms hitting the wall. The force will appear to be 
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steady rather than staccato. The average force due to all the atoms hitting 
this wall is 


y2 


z 1 
Fac 22.15 
L3 ( ) 
The other $N atoms are moving parallel to this wall and apply no force 
on it. 
We're almost done. What about the average pressure? It is the average 


force divided by the area of that face 


mv" 


Lad? 


1 
P=3N (22.16) 

This is very nice because L? is just the volume of my box. I send it to 
the other side and find 


1 
PV = zN. (22.17) 


This is what the microscopic theory tells you: if your atoms all have 
a single speed, if they're moving randomly in space so that a third of them 
are moving back and forth against that shaded wall and the opposite wall, 
then PV = i Nmv’. Experimentally, you find PV = NkT. So, you compare 
the two expressions and out comes one of the most beautiful results: 


L = Ser. (22.18) 
2 2 

This profound formula not only confirms the kinetic theory, that 
there are atoms, but it also gives for the first time the microscopic mean- 
ing of temperature. What you and I call the absolute temperature of a gas 
is, up to the factor 3k, simply the kinetic energy of the atoms. If you put 
your hand into a container with some gas and it feels hot, the temperature 
youre measuring is the kinetic energy of the atoms. We see why abso- 
lute zero is absolute. As you cool your gas, the kinetic energy of atoms 
steadily decreases, but you cannot go below not moving at all, right? That’s 
the lowest possible kinetic energy. That’s why it’s absolute zero. At that 
point, everybody stops moving. That’s why you have no pressure. Now, 
these results are modified by the laws of quantum mechanics, but we don’t 
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have to worry about that now. In classical physics, it’s correct to say that 
when the temperature goes to absolute zero, all motion ceases. 

So, bear in mind that absolute temperature is a measure of atomic ki- 
netic energy for a gas. Every ideal gas, whatever it’s made of, has the same 
kinetic energy per atom at a given temperature. The kinetic energy will be 
the same, but not the velocity. As the carbon atom is heavier than hydro- 
gen, it will be moving slower at that temperature in order to have the same 
kinetic energy. 


22.4 Statistical properties of matter and radiation 


In a gas, the atoms are moving anywhere they want in the box. In a solid, 
every atom has a place. If you take a two-dimensional solid, the atoms form 
a nice array; for example, a square grid or lattice. They are not absolutely 
locked to the points on the grid; instead, they execute simple harmonic 
motion about the lattice points. They experience a potential that looks like 
an egg carton. The minima of the potential are at the lattice points. If an 
atom moves off the minimum in any direction, there is a restoring force 
to bring it back to the minimum. At T = 0 all the atoms will sit at these 
minima. If there is an atom here, at this minimum, I know that if I go 100 
times the lattice spacing in the x or y direction (for a square lattice), there 
will be another atom sitting there. That’s called long-range order. If you 
heat up that solid, the atoms start vibrating. If you put the solid on top of 
a hot plate, the atoms in the hot plate will bump into these atoms and start 
them vibrating. Their average locations still exhibit long-range order. But, 
in a hot solid, the atoms are making violent oscillations around their as- 
signed positions. If you energize them more and more, there is nothing to 
prevent them from rolling over to the next minimum. Once that happens, 
all hell breaks loose, because they don’t have any reason to stop there. They 
start going everywhere. That’s what we call melting. 

A liquid is more subtle. If you look at a liquid locally, the interatomic 
spacing is very tightly constrained. Locally, the environment around an 
atom is known, but if you go a short distance, I cannot give you a pre- 
cise location where you may find another atom. So, we say a liquid has 
short-range positional order, but not long-range order. An ideal gas has no 
order at all. If I tell you there’s a gas atom here, I cannot tell you where 
anybody else is because nobody has any assigned location in relation to 
others. 
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Going back to the gas, of course, a third of the atoms are not all mov- 
ing in one direction. They're moving in random directions. They are also 
not moving at a fixed speed v either. If I give you a gas at 300K and you take 
this formula Eqn. 22.18 literally and calculate from it a certain v, you will 
not find every atom at that velocity. Not only are the atoms moving in ran- 
dom directions, they are also moving with essentially all possible velocities. 
The velocity you are getting from this formula is some kind of average. If 
you have the ability to see the gas atoms and measure each velocity, what is 
the probability P(v) that you get a magnitude v? The answer for a typical 
case is shown in Figure 22.2 and has the form 


mv 
P(v) x ve AT, (22.19) 


It has a certain peak, a most probable velocity (actually speed). 
The average kinetic energy will obey im? = 3KT. This is called the 
Maxwell-Boltzmann velocity distribution, though it should be called the 
speed distribution. This is the detailed description of what’s happening in 
a gas. A given temperature does not pick a unique velocity, but it picks 
a unique graph for P(v) parametrized by T. You will understand this 
distribution better upon reading the end of Chapter 24. 

Here’s digression. Take a box containing just radiation; in other 
words, go inside a pizza oven. Take out all the air; the oven is still hot, 
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Figure 22.2 The probability P(v) « ve” xT that an atom in a gas will have a 
speed v. 
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and the walls of the oven are emitting electromagnetic radiation. Electro- 
magnetic radiation comes in different frequencies, and you can ask how 
much energy is contained in every possible frequency range. You know 
each frequency is a color if it is in the visible range. So, how much energy 
is in the red and how much is in the blue? That graph P(œ) as a function 
of œ also looks like the Maxwell-Boltzmann distribution and is called the 
Planck distribution. Whereas for atoms, P(v) is determined by tempera- 
ture, the Boltzmann constant k, and the mass of the molecules, P(q@) is 
determined by temperature, the Boltzmann constant, the velocity of light, 
and Planck’s constant h ~ 10~*4J - s. You give me a temperature, and I will 
draw you another one of these roughly bell-shaped curves. As you heat 
up the furnace, the shape will change, shifting to the right, toward higher 
frequencies. 

In summary, a temperature for electromagnetic radiation implies a 
particular distribution of energy at each frequency, while for a gas it means 
a distribution of atomic velocities. 

One prediction of the Big Bang theory is that the universe was formed 
roughly 13.8 billion years ago. In the earliest stages the temperature of the 
universe was incredibly high, and then as the universe expanded it cooled. 
Today, at the current size, it has a certain average temperature, which is a 
remnant of the Big Bang. And that temperature means that we are sitting 
in the furnace of the Big Bang. But the furnace has cooled a lot over the 
billions of years. The temperature of the universe is around 3K. You de- 
termine that by pointing your telescope at the sky. Of course, you're going 
to get light from this star and that star. Ignore all the pointy things and 
look at the smooth background, and it should be the same in all direc- 
tions. Plot that radiation as a function of œ, and you'll get a perfect fit to 
the Planck distribution with T ~ 3K. If you go to intergalactic space, that 
is your temperature. Were all living in that heat bath at 3 degrees, and it is 
getting colder as the universe expands. 


22.5 Thermodynamic processes 


Now we are going to study thermodynamics in detail. There is only one 
system we will study, which is an ideal gas sitting inside a cylinder as shown 
in Figure 22.3. It has a pressure P; and a volume Vı. I’m going to put 
a dot in the P — V plane at (P1, Vi), and that’s my gas. The state of my 
gas is summarized by where I put the dot. Every dot here is a possible 
state of equilibrium for the gas. Remember, if you look under the hood, 
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Figure 22.3 Left: A piston and cylinder combination. The weights balance the 
pressure P of the gas inside. (The atmospheric pressure is assumed to be zero or 
added to that of the weights.) The volume V is just the volume below the piston. 
Right: A point (P1, V1) representing a state of the gas and a possible quasi-static 
path in the P — V plane to state (P2, V2). To move quasi-statically, the weights 
must be replaced by fine grains of sand, which may be added or removed. 


the gas is made up of roughly 10% atoms. The microscopic state of the 


gas is obtained by giving 107° locations and 107° velocities. According to 
Newton, that’s the maximum information you can give me about the gas 
right now. With that and Newton’s laws, I can predict the future state. But 
when you study thermodynamics, you don’t really want to look into the 
details. You want to look at gross macroscopic properties, and there are 
two that you need: pressure and volume. Now, you might say, “What about 
temperature?” Why don’t I have a third axis for temperature? Why is that 
not a property? Because PV = NKT. I don't have to give you T, if I give 
you P and V. By the way, PV = NkT only applies to ideal gases, whose 
atoms and molecules are so far apart that they don’t feel any forces between 
each other unless they collide. In general PV = NkT will be replaced by an 
equation of state, which is in general a complicated relation between P, V, 
and T. But we are going to study only dilute ideal gases. 

Back to the gas in the cylinder. It had three weights on top of the 
piston. I suddenly pull out one weight. What do you think will happen? 
The piston will now shoot up and bob up and down a few times. Then, 
after a fraction of a second, it will settle down at a new location (P2, V2). By 
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“settle down,” I mean after a while I will not see any macroscopic motion. 
Then the gas has a new pressure and a new volume. It’s gone from being 
at 1 to being at 2. What happened in between the starting and finishing 
points? You might say, “Look, if it was here in the beginning, and if it was 
there later, it must’ve followed some path.” Not really. Not in this process, 
because if you do it very abruptly, suddenly throwing out one-third of the 
weights, there’s a period when the piston rushes up, when the gas is not 
in equilibrium. There is no single pressure you can associate with the gas. 
The bottom of the gas doesn’t even know the top is flying off. It’s at the 
old pressure. At the top the gas is at a lower pressure. We don’t call that 
equilibrium. So, the dot, representing this system, moves off the graph. It’s 
off the radar, and only when it has finally settled down, when the entire 
gas has made up its mind on what its pressure is, can you represent it once 
again as a point (P2, V2). 


22.6 Quasi-static processes 


Now we have a little problem. We have these equilibrium states, but when 
you try to go from one to another the system flies off the P — V plane. 
So, you want to find the means by which you can stay on the PV plane as 
you change the state of the gas. That brings us to the notion of a quasi- 
static process. A quasi-static process is trying to have it both ways, in which 
you want to change the state of the gas, and you don’t want it to leave the 
PV diagram. You want it to be always infinitesimally close to equilibrium. 
So, what you really want is not three big fat weights on the piston, but 
many, many tiny grains of sand that produce the pressure. Now, remove 
one grain of sand. The piston moves a tiny bit and very quickly settles 
down. It is again true that during the tiny bit of settling down you didn’t 
know what it was doing, but you certainly nailed it at the second location 
very close to the first. You remove one grain at a time and make the grains 
smaller and smaller, and you wait longer and longer between these changes 
to permit equilibration. Then, in a mathematical sense, you will be able 
to connect the dots representing equilibrium states to form a continuous 
line. That is a quasi-static process. Our discussions are not totally academic 
because many processes at real speeds are approximately quasi-static. For 
example, the internal combustion engine in your car completes thousands 
of cycles per minute, and yet at each instant the gas inside is close enough 
to equilibrium to be represented in the P— V plane. 
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The above quasi-static process is also reversible, which means the fol- 
lowing. If, when I took off a grain of sand, the representative point moved 
from one dot in the P— V plane to a nearby dot, then, when I put the grain 
back, it'll go back to where it was. So you can go back and forth. But now, 
that’s an idealized process. If you have friction, and you take out a grain and 
it goes up, then when you put the grain back, it will not come back to quite 
where it was. You cannot put Humpty Dumpty back. Thus most processes 
are irreversible, even if you do them slowly, due to effects like friction. We 
will assume in our discussion that idealized reversible processes exist. 

In the old days, when we studied a single particle in the x — y plane, 
I just said the particle goes from here to there. There was no restriction on 
how fast it moved. Particles had trajectories no matter how quickly they 
moved. As for thermodynamic systems, you cannot move them too fast. 
They are extended and you have a huge number of atoms described by a 
few macroscopic numbers like pressure. You cannot change one part of 
the gas without waiting for the rest of it to respond, readjust, and achieve 
a global value for the new pressure and other thermodynamic variables. 


22.7 The first law of thermodynamics 


Every dot in the P — V plane denotes an equilibrium state. In every state 
of the system, I’m going to define a new variable, U, called the internal 
energy of the gas. For the ideal gas, to which we will restrict most of our 
discussions, it is simply the kinetic energy of the gas molecules. (For non- 
ideal gases, solids, and liquids, U is the total energy including potential 
energy.) Thus 


3N 3 
—kT =- 
2 2 


3 
U= nRT = Y (22.20) 


using PV = NkT = nRT. That means that, at any given point in the P — V 
diagram, you have a certain internal energy. Notice that the internal energy 
of an ideal gas depends only on the temperature. Thats something very, very 
important. If the temperature has not changed, the internal energy has not 
changed. 

Note the change in notation: in mechanics E stood for the total en- 
ergy and U for the potential energy. Now U stands for the total energy. 
What is worse, in the case of the ideal gas the thermodynamic U is all 
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kinetic! You just have to learn to live with such conflicting notations. When 
different parts of the subject are invented by different people, this will 
happen: k is used to denote the spring constant, the momentum of the 
photon, and now Boltzmann’s constant! The best we can hope for is that 
two different definitions of the same symbol do not appear in the same 
discussion. 

I’m ready to write down what’s called the first law of thermodynamics 
that talks about what happens if you make a move in the P— V plane from 
point 1 to point 2. The internal energy will change by dU = U2 — Uj. We 
want to ask what causes the internal energy of the gas to change. There 
are two ways you can do it: you can do work on the gas by moving the 
piston, or you can put the gas on a hot plate. If you put it on a hot plate, 
we know it’s going to get hotter. If it gets hotter, the temperature goes up. If 
the temperature goes up, the kinetic and hence internal energy goes up. Of 
course, you can put it on a cold plate and take out some heat. So in general 


dU = AQ— AW. (22.21) 


Here AW is positive if work is done by the gas, and it is negative if work 
is done on the gas. As for AQ, it is positive (negative) if heat is put in 
(taken out). 

Note that the infinitesimal change in U is denoted by dU while small 
quantities of heat and work are denoted by AQ and AW. The reason will 
follow later. 

What’s the formula for work done by the gas? If it expands against the 
applied pressure 


AW = Fdx = PA dx = PdV. (22.22) 


If the gas is compressed so that dx < 0, then the work done by the gas is 
negative. That leads to this great first law of thermodynamics, as applied to 
the gas: 


dU=—PdV+AQ (first law). (22.23) 
It expresses the law of conservation of energy. It says the energy of 


the gas changes either because you pushed the piston or the piston pushed 
you; or because you put it on a hot plate or cold plate to add or take out 
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heat. We are now equating putting it on a hot plate or cold plate as also 
equivalent to giving or taking out energy, because we recognize heat as 
energy. 

If you fix the piston so it cannot move, and you put it on a hot plate, 
the PdV part will vanish because there is no chance of a dV. On a hot 
plate there are fast-moving molecules. When they collide with the slow- 
moving gas molecules, typically the slow ones become a little faster and 
the fast ones a little slower, and therefore there will be a transfer of ki- 
netic energy to the gas. Another thing you can do is thermally isolate your 
gas so no heat can flow in or out of it, and then you can either have the 
volume increase or decrease. If the gas expands, dV is positive and the 
—PdV is negative, and so is dU. That’s because the molecules are beat- 
ing up on the piston and moving the piston. Remember, applying a force 
doesn’t cost you anything. But if the point of application moves, you do 
work. And who’s going to pay for it but the gas? It'll pay for it through its 
loss of internal energy. Conversely, if you push down on the gas, dV will 
be negative and dU = —PdV will be positive, and the energy of the gas 
will go up. 

Let us now calculate the work done in a process where a gas goes 
from a point V to a nearby point V + dV when the pressure is at P(V), as 
in Figure 22.4. The infinitesimal work done, dW = P(V)dV, is the shaded 


(P, V;) 


AQ 


(P3, V3) 


V 


dV 


Figure 22.4 During an infinitesimal change by dV, the work done by the gas is 
the shaded area dW = PdV. If this were an isothermal process, some heat AQ 
would flow in from the reservoir to keep T and U constant. For a finite process 
the work done is the area between the V axis, the graph P(V), and the vertical 
lines at V = V; and V = Vp. 
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area, and the work done over a macroscopic path is 


V2 
Wais | P(V)dV. (22.24) 
Vi 


The work is the area enclosed by the function P(V), the V-axis and 
the vertical lines V = Vı and V = V2. To get an analytical expression for 
the work you need to know P as a function of V. We are going to consider a 
special case of an isotherm, a graph of a gas at a given temperature. Because 
PV = nRT, if T is constant, PV is a constant. The graph is a rectangular 
hyperbola: the product of P and V constant, so when the P increases, V 
decreases, and vice versa. 

We want to take the gas for a slow quasi-static ride from 1 to 2 along 
the curve P(V) = NET The work done by the gas is 


w “2 dV V: 
W= J P(V)dV = NkT | T” —NkTIn—. (22.25) 
vı v V Vi 


If I make it go backward from 2 to 1, the work done by the gas is 
given by the same area, but with a minus sign. If you go to the right, the 
area is considered positive. If you go to the left, the area is considered nega- 
tive. The integral Eqn. 22.25 of course gives the right answer for both cases 
because the In will change sign under Vj <> V2. 

If that is W)_,.2, the work done by the gas, what is the heat input? 
Recall the first law 


dU = AQ- PdV. (22.26) 


This gas did not change its temperature and U = 3 NkT implies U 
didn’t change. So 


AQ=Pdv (22.27) 


every step of the way and the gas has had a heat input of 


y; 
Q= NkT In —. (22.28) 
1 
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What do you think is happening to the gas here? Think of the piston 
and weight combination as the gas expands a little bit isothermally. The 
expansion should cool the gas if it were isolated because expansion costs 
energy. However, the gas is held at a fixed T by placing it on some reservoir, 
like a hot plate. Thus when it tries to cool down, heat AQ flows from the 
reservoir, maintaining the temperature, as indicated by the vertical arrow 
in Figure 22.4. So, what the gas is doing in this case is taking heat energy 
from below (assuming that is where the reservoir is) and working against 
the atmosphere and weights above. It takes in with one hand and gives out 
to the other, converting heat into work, without changing its own energy. 


22.8 Specific heats: cy and cp 


You have to be careful when you talk about specific heats of gases, and here 
is why. For liquids and solids, given dQ = mcdT, we define c = a per 
unit mass of the substance. For a gas, you've already seen that what you 
want to count is not the actual mass, but the moles. Every molecule gets 
a certain amount of energy, namely SkT, and so you just want to count 
the number of molecules, or the number of moles. Now, there are many 
ways in which you can pump heat into a gas, but let’s agree that we will 
always take one mole from now on and not one kilogram. Take a mole of 
some gas and call the molar specific heat as the energy needed to raise the 
temperature of one mole by one degree. If you take one mole, it has energy 
U= 3 NakT = 3 RT. From the first law 


AQ= dU + PdV (22.29) 


ce a + p (22.30) 
dT dT dT’ i 


There’s a problem here. Did you or did you not allow the volume 
to change while adding heat? Thats going to determine what the specific 
heat is. In other words, when a solid is heated, it expands such a tiny 
amount that we don’t worry about the work done by the expanding solid 
against the atmosphere. But, when you heat a gas, the volume changes so 
much that the work it does against the external world is non-negligible. 
Therefore, the specific heat is dependent on what you allow the volume 
term to do. 

Consider cy, the specific heat at constant volume where you don’t let 
the volume change; you clamp the piston. You pump in heat by putting it 
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on a hot plate. All the heat goes directly to internal energy. None of that is 
lost to expansion. Setting dV = 0 in Eqn. 22.30 and n= 1 in U = 3nRT, 

_ dQ| dU 3 
= dAT|y dT 2 


Cy R. (22.31) 


Then, there’s cp, the specific heat at constant pressure defined as fol- 
lows. You have some gas at some pressure. You pump in some heat but you 
don’t clamp the piston. You let the gas expand freely at the same pressure. 
For example, if it’s being pushed down by the atmosphere, you let the pis- 
ton move up if it wants to, maintaining the same pressure. If it moves up 
a little bit, some of the heat that you put in goes to changing its internal 
energy and some into doing the work PdV. So now 


dQ} dU dV 3. d(PV) 3. d(RT) 
w= a df 2° af 2°" aT 


3 5 
=SR+R=SR=G4R (22.32) 


where we have taken P, which is being held fixed, into the T derivative and 
used PV = RT for one mole. 

You should have expected cp > cy because some of the heat goes into 
expanding the gas and only the rest into raising its T. This is confirmed 
and quantified by the result 


Cp=cy+R. (22.33) 


Finally, consider the ratio that will come in later: 


_o 5 


i 22.34 
m (22.34) 


y 


Notice that neither cp nor c, depends on what particular mono- 
atomic gas is involved. All will have the same specific heat per mole or 
molar specific heat. They won't have the same specific heat per gram, be- 
cause one gram of two different gases will have a different number of moles 
or atoms. 

One final caveat: all this is for a mono-atomic gas, whose constituents 
are essentially points, with just the kinetic energy of translation. Compare 


Thermodynamics I 393 


this to, say, a diatomic gas, whose molecules are two atoms joined together 
like a dumbbell. The kinetic energy of the dumbbell has two parts, as you 
learned long ago. It can rotate around some axis and its CM can also move 
in space. So, the internal energy has also got two parts: due to the motion 
of the center of mass and due to rotation. Some molecules also vibrate. The 
value of y is not 2 for these. 


CHAPTER 23 


Thermodynamics II 


23.1 Cycles and state variables 
Let us begin with the first law 


dU = AQ— AW = AQ- PdV, (23:1) 


which says that dU, the change in the energy of the gas, equals the heat 
input AQ minus the work done by the system, A W = PdV. Why do we 
refer to some infinitesimals with a A and some with a d? 

This has to do with whether these refer to simply small quantities or 
to small quantities that correspond to a change in a state variable, which 
is some function that depends on the state of the gas, specified by P and 
V. Consider the internal energy U. It is a state variable. That means that 
at each point in the P — V plane it has a definite value. If we take the gas 
for a spin and come back to where we started, U will return to its orig- 
inal value. It will not matter what path we took: if you are back where 
you started, U is back to its value. Thus in Figure 23.1, if we start at some 
(P, V) and go around the shaded area in the clockwise sense and return, 
so does U to U(P, V). For example, for an ideal gas, U = 3PV, and dU is 
the change in this function due to a slight change in state. At any point 
(P, V) we may speak of the internal energy resident in the gas. If you 
peek in and add the kinetic energies of all the molecules, you will get this 
number. 

This is not so for the work W. We cannot speak of the work in the 
gas—it does not correspond to anything resident in the system. Consider 
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(P, v) 


V 


Figure 23.1 The work done by a gas as it goes over a closed cycle equals the area 
enclosed by the cycle in the P — V plane. During AB the gas expands and does 
work, and during BA it is compressed and has work done on it. The shaded area 
is the algebraic sum of the two. 


what happens when we go around the closed loop shown in Figure 23.1. 
What is the work done? We have seen in the last chapter that, for an open 
segment, it is the area under the graph if we move in the direction of in- 
creasing V and minus that if we move to the left. For the closed region 
shown, it must be clear that as we move along the upper part, from A to 
B in the direction shown, the work done is the area under that part of the 
curve. Then as we move back along the lower part, from B to A, we must 
subtract the area under that segment. The algebraic sum of these is the area 
of the shaded region. This is written as follows 


Woe f rvnav. (23.2) 


If there were a definite value of W attached to each state, there would 
be a problem: after adding Weycle to the system, you have come back to the 
same state! So there isn’t a definite value of W attached to each state; W 
is not a function W(P, V), unlike U, which is. On the other hand, we can 
always say some tiny amount of work AW was done by the system; it is 
just a number. 

Just like W, there is no function Q(P, V) associated with each state. 
You cannot peek into the gas and say, “I see this amount of heat there.” Let 
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me prove it. Say there is some Q that you claim is resident in the system 
when it is at (P, V). I then send you away and take the gas through the 
cycle shown. Because U(P, V) is back to its old value, and work Weycle was 
done by the system, the Q in the system must have increased by Weycle. 
Now you come back, and I ask you how much heat there is in the system. 
You will give the old answer because the system is in exactly the same state, 
but I know Q has increased by the area of the closed loop. So Q is not a 
state variable, though we can always say some number of calories A Q were 
added to the system. 

In summary, only if F(P, V) is a state variable will we use dF to 
represent the change in that function as we move to a neighboring point. 


23.2 Adiabatic processes 


We have already considered the work done under an isothermal process: 
wi, > = NkT Ìn i In the isothermal case, you keep the gas on a hot plate 
at a given temperature T, and as the volume changes, heat comes in or 
goes out to maintain T. A process at constant pressure P is called isobaric. 
The work done is trivial: we 2 = P(V2 — Vj), which is just the area of a 
rectangle of height P and width V2 — V1. 

Now I’m going to consider the adiabatic process in which the gas is 
thermally isolated, so AQ = 0. You wrap this guy in a blanket and you 
do things to it. Consider point A in Figure 23.2, which under isother- 
mal expansion ends up at B, on the same isotherm T = Tı. Likewise, 
from A’ the system can expand isothermally at T = Tz to C. Which 
is higher, Tı or T2? Take a point on A’ on T, directly below A. It has 
the same volume as A but lower pressure, so PV = NkT is less. So 
T> < Ti: 

But suppose I start at A and let the gas expand adiabatically. The gas 
expands against the external pressure but no heat is allowed to come in. 
It ll pay for the work through its own internal energy, which will go down, 
and that means T will go down. The gas will be cascading down from 
one isotherm to another, until you stop somewhere at the lower temper- 
ature, such as the point D at T2. Another way to say this is that the drop 
in pressure for a given increase in volume would be more precipitous for 
the adiabatic process. The evolution will not be P « 1/V, but something 
steeper. What is the equation for an adiabatic process? What is P as the 
function of V? 
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Figure 23.2 The figure shows two isothermal curves at T; and T and two 
adiabatic curves crossing them. In the latter case P falls more rapidly with 
increasing V because it gets no help from outside. 


This will be determined by the first law of thermodynamics. First we 
combine the definition of adiabatic, AQ = 0, with the first law to obtain 
AQ=dU+ PdV =0. (23.3) 


This equation relates the change in U to the change in V in an adi- 
abatic process. Given U = 3nRT, it follows that dU = 3 nRdT = nc,dT. 
Using PV = nRT we find 


RT 
neydT + ——dV =0. (23.4) 
Canceling the n and rearranging 
=a ft = 0, (23.5) 


Note that because n drops out, we could have analyzed just one mole of 
the gas from the outset. 

Not only does this equation tell us what we expect intuitively—that 
if the volume increases (dV > 0), the temperature decreases (dT < 0)—it 
quantifies the dT, drop in temperature under adiabatic expansion (or its 
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opposite under adiabatic compression). However, what we wanted to do 
was relate dV to dP, to see how the adiabatic evolution takes place in the 
P— V plane. Let us postpone that and first get a result connecting V and T, 
because it will be very useful later. We can then use PV = RT to eliminate 
T from that result. 

Integrating Eqn. 23.5 between states 1 and 2 yields 


—In—+In—=0. (23.6) 


We may rewrite this as follows: 


hi Vv 
In ((F] ; #) =0 (23.7) 
Tı Vi 


Gir v: 
EB . a =1 usingln1 =0 (23.8) 
TORY, = THY, (23.9) 


which may be rewritten as 
To/RY =C, (23.10) 


where C is a constant along the adiabatic path connecting state 1 to state 2. 
For an ideal mono-atomic gas, c,/R would be just 2 but let us keep it in 
this form for now. 

To find pressure as the function of volume on an adiabatic curve, we 
eliminate T in 


TV V= C (23.11) 
pv yo 
pe/RyQ+a/R) — C' another constant (23.13) 
cy+R 
PV œ = C"on raising both sides above (23.14) 


to the R/c, power. 


Thermodynamics II 399 


Finally, we have the familiar version 


Pi V” = PV? =PV’ =C where (23.15) 
1 2 
R l 
aero (23.16) 
Cy Cy 


Here P and V refer to a generic point on the adiabatic curve, and we have 
renamed the final constant C” in Eqn. 23.14 as C. It’s not a constant you 
can look up in a book like the velocity of light. It depends on this particular 
sample of gas. (When we say the energy is constant for a particle, it is not 
a universal constant like Boltzmann’s constant k; instead, it is constant for 
this particle on this particular trajectory, say, as it gains height and loses 
speed. Here, as the gas loses P and gains V in an adiabatic process, PV” 
remains fixed at a value C that depends on the particular sample.) 

Now we calculate the work done in an adiabatic process, say from B 
to C in Figure 23.2. 


V, 
we? =l * p(v)dVv (23.17) 
BoC : 
VB 
Vc 
a CV’ dV (23.18) 
VB 
1-y 1l-y 
V V 
-|c C _¢_8 l (23.19) 
l-y l-y 
l-y l-y 
V V 
= | PoV —— — Pg V}? — 23.20 
l C Ci=} BYB l-y ( ) 
= a Pcve (23.21) 
y-l l 


where we have used C = Pc Vč = Pg v} in the penultimate step. 


23.3 The second law of thermodynamics 


We come to the last part of thermodynamics, which to me is the most 
beautiful. It starts out with the following consideration. There are certain 
things in this world that seem perfectly allowed but dont seem to hap- 
pen. Take the Joule experiment. You take water in a cylinder with a shaft 
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with fins that can spin. You hang some weight over a pulley, and, as the 
weight goes down, the shaft spins; the water heats up. You film a movie of 
that, and the movie is so detailed you can even see individual molecules. 
Now you play the movie backward. What do you find? Suddenly, the 
weight starts moving up, the shaft spins the other way, and the water cools 
down. This does not violate any of the laws you have learned, including the 
first law of thermodynamics. When the weight went down and the water 
heated up, some amount of work was done on the water, and the energy 
of the water went up. In the reverse process, the energy of the water went 
down and the weight went up, and an equal amount of work was done by 
the water. But that reverse process doesn’t seem to happen, even though 
microscopically there is nothing funny about it. If you looked at the colli- 
sions between the water molecules and the fins, you would not know at the 
microscopic level whether the movie was going forward or backward; every 
collision would obey Newton’s laws, and energy and momentum would be 
conserved. Why is the reverse process, allowed by all the laws of mechanics, 
somehow forbidden from happening in real life? 

Here is another example. Consider an eraser on a table. I give it a push 
and it moves a bit and stops. The table and eraser heat up. Take a movie 
of that. Play it backward. You will find the table and eraser cool down, and 
the eraser moves backward, picking up speed. You might laugh, but you 
have no reason to do so right now, because the movie doesn’t contradict 
any physics you have learned—just your expectations based on your daily 
experience. In the reverse process, every atom is made to stop, turn around, 
and move in the reverse direction at the reversed velocity. And every colli- 
sion between atoms on the desk and atoms on the eraser will obey all the 
laws of mechanics in the forward and backward movies. 

In the cases we have considered, the common feature seems to be that 
kinetic energy can turn into heat easily, but not so the reverse. 

Now take a different kind of example. I put some gas molecules in the 
left half of a box, with a partition holding them in that side. Then I remove 
this partition and I wait a little bit. The gas will fill up the box. That’s like 
perfume leaking out of the bottle. Now I take a movie of these molecules 
and play the movie backward. In the backward movie the gas, which had 
spread over the box, will spontaneously gather in the left half. The movie 
does not violate any of the laws of mechanics. But what it portrays does 
not happen. 

I take a chunk of some hot copper and a chunk of some cold copper. 
I place one on top of the other, thermally isolate them from the outside 
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world, come back in an hour, and they have both become lukewarm. That’s 
fine. But now maybe, if I wait long enough, the lukewarm chunks will 
spontaneously separate into hot and cold. That doesn’t seem to happen. 
The heat seems to flow spontaneously from hot to cold but never back 
from cold to hot. But flowing from cold to hot would not violate the 
law of conservation of energy. As long as the same number of calories go 
from cold to hot, it doesn’t violate anything. And yet that doesn’t seem to 
happen. 

This list of things allowed by all known laws and yet forbidden in 
nature can go on and on. Because we cannot explain this interdiction with 
any known laws of physics, we elevate it to a new law. The new law could 
say all of these things are forbidden. That’s not a good enough law. The list 
of forbidden processes is a mile long. 

Amazingly, there is one law, a single law, that not only is qualitative, 
but is quantitative, and it tells you exactly which things can happen and 
which cannot. That is the second law of thermodynamics. What form does 
it take? We will introduce a certain quantity called entropy. The second 
law will say that the entropy of the universe will never decrease. If the pad- 
dle spun the other way, the weight went up, and the water cooled down, 
you can show the entropy of the universe would have actually gone down. 
That’s why that’s not allowed. If you drop an egg and it splatters all over the 
floor, it cannot rejoin and rise back to your hand, for this too would cause 
a decrease of entropy of the universe. So will the unmixing of lukewarm 
into hot and cold and all the other forbidden things. 

This great law was discovered following the investigations of an 
engineer called Sadi Carnot. One does not generally wake up and say, 
“Tm going to discover a great law.” You just go about your business, but 
you need to recognize it when you've stumbled on something big, as 
Archimedes did in his bathtub. Carnot had a very practical question about 
engines, like the steam engine. You take some coal, you burn it to boil some 
water, it turns into steam, which in turn pushes the pistons that turn the 
wheels that make the train goes forward. What happens in a steam engine 
is described schematically in the left half of Figure 23.3. Heat Q; from a 
reservoir at Tı goes into the engine. Out comes some mechanical work W 
and some exhaust emission Q; at a lower temperature T). In the case of the 
steam engine T; is the temperature of the furnace, Qı the heat generated 
by burning the coal; Tz is generally room temperature. If you ever have 
the opportunity to see a steam engine, you will notice that there’s a lot of 
hot steam coming out of the side. That is Q2. Note that Q3 is defined to be 
positive coming out of the engine. 
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Figure 23.3 A heat engine (left) takes in heat Q; from a reservoir at T}, does 
some work W, and rejects heat Q to a colder reservoir at T2. Things go the other 
way in a refrigerator (right): work W is pumped in to transfer heat Qz from T to 
a higher temperature T}. 


The engine operates in a cycle, that is, it can do all of this (absorb Q1, 
do work W, and reject Q2) over and over again. 

You can fill in the corresponding items for a gasoline engine. 

On the right half of the figure is a device called the refrigerator: it 
takes in Q, from a cold place (freezer), has some work input W (from the 
electrical socket that powers the compressor), and emits Q; = Q2 + W into 
the environment (your kitchen). The refrigerator will play an important 
part later, but let us return to the engine for now. 

By the first law, the work done by the engine (which manifests itself 
as the kinetic energy of the locomotive or automobile) is 


W=Q;—Q,. (23.22) 


We define a quantity called efficiency y. It’s the work you want (W), 
divided by the heat you pay for (Q1): 
J= W_ Q-Q_ Q2 


_— A (23.23) 
Qı Qı Qı 


Every engine takes in heat. Some of it is converted to work and some 
of it is rejected. To the extent heat is rejected, the efficiency is less than 1. 


Thermodynamics II 403 


Why not build an engine that doesn’t reject any heat at the lower temper- 
ature T2? Why not take all the heat and convert it to work? Carnot gave a 
great argument for the most efficient engine you can build operating be- 
tween these two temperatures Tı (at which heat is produced) and T, (at 
which heat is rejected). He showed that there is an upper limit to n and the 
limit is less than 1. 

What is this limiting 7 and how did he find it? He needed a postulate. 
Carnot’s postulate is the old version of the second law of thermodynamics, 
and it is fully equivalent to the modern version that makes reference to 
entropy. Here is Carnot’s law: You cannot build an engine whose sole effect 
is to transfer some heat from a cold body to a hot body. 

But we already have noticed that heat cannot flow from cold to hot 
spontaneously; that is why two lukewarm blocks of copper cannot separate 
into hot and cold blocks. All Carnot seems to say is that this cannot hap- 
pen, because he postulates it cannot. In fact, from this one postulate, one 
may develop the notion of entropy, which in turn will be used to outlaw 
all the forbidden processes with a single restriction on entropy. 

It doesn’t take much effort to have heat flow from hot bodies to cold 
bodies: just connect them with a metal rod and wait. The sole effect of that 
is the transfer of heat from hot to cold. That’s the natural order of things. 
Carnot is saying that you can never build a contraption whose sole effect 
is the transfer of heat from a cold body to a hot one. That is going to be 
taken as a reasonable postulate, and we want to see what we can get out 
of that postulate. It turns out that this postulate is going to put a bound 
on the efficiency of heat engines, by demanding that you must necessarily 
reject some heat Q2. From this postulate evolved the second law, assuming 
more and more sophisticated forms, and culminating in the language of 
entropy. 


23.4 The Carnot engine 


The engine that Carnot conceived is depicted in Figure 23.4. Take an 
isotherm at Tı with points A and B on it. Draw an isotherm at T) < Tı. 
You remember the adiabatic curves are much steeper. So draw two adia- 
batic curves leaving A and B at T; to join D and C on the colder isothermal 
at T>. Then, take an ideal gas from A to B to C to D and back to A. That 
is the Carnot cycle. In the process AB, take the cylinder with the gas in it, 
sitting on top of a reservoir at T}, and slowly lift the grains of sand on the 
piston so it expands to a volume B at the same temperature. Then, having 
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Figure 23.4 The Carnot cycle, running as an engine. It can also be reversed to 
run as a refrigerator. The process A — B is isothermal expansion at Tı during 
which heat Q; is drawn in from the hot reservoir. The part B — C is adiabatic 
expansion, and the system cools down to T>. (The box surrounding the system 
signifies thermal isolation.) The part C > D is isothermal compression at T3 
when Q; is rejected to the cold reservoir. Finally the adiabatic compression 

D —> A closes the cycle. 


reached B, you thermally isolate the gas (shown by a box surrounding it 
on the figures at the right). Take out even more grains of sand. Now, it’s 
expanding without any energy coming in. It cools down to point C at T2 
along the adiabatic curve BC. Now you put it on a cold reservoir at T and 
slowly start putting the sand grains back until you get to D. Finally, you 
isolate the gas and put more grains of sand back until you come back to A. 
Obviously, such a path exists because of the way isothermal and adiabatic 
lines crisscross the PV plane. 

The important feature of the Carnot cycle is that it’s reversible. It’s 
reversible because at every stage, you are almost in equilibrium. If you 
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take out a grain of sand and the piston moves up, you can put it back and 
move the piston back. Ifit absorbs some heat going one way, it will reject it 
going the other way. So, it can also run backward as a refrigerator (shown 
in Figure 23.4) following A> D —> C —> B —> A. Some heat Q; is taken 
from the lower reservoir and the heat Q; is delivered to the upper reservoir 
because some work W is done on the refrigerator. You might say, “You vio- 
lated the second law. You have transferred heat from a cold to a hot body!” 
But it was not the only thing that happened, because some compressor 
somewhere did some work. Some water in a dam flowed downhill to gen- 
erate that electrical power. You paid your electric bill. Things are not how 
they used to be. What is not allowed is to have heat flow from cold to hot 
with nothing else changed. 

Notice that after doing its cycle, the gas comes back to the starting 
point A. That means you can do it over and over again, and Q1, Q2, and W 
refer to one cycle. That’s why it’s a useful engine. (You can always make a 
disposable, one-shot engine that converts heat fully to work: take a hot gas 
in a cylinder, and release the piston so it can rise up and lift some weights 
and cool down in the process. You have converted some heat fully to work, 
but things are not back where they were. We are only interested in engines 
that operate in cycles.) Carnot’s question was, “What’s the efficiency of this 
engine?” You might wonder why someone would be interested in the 7 of 
such a primitive engine. Before I explain that, let us first calculate 7. 


To find y= 1 — & we need Q; and Qz. Because AB is isothermal, the 


heat input Q; is the work done Walp pi 


yV; 
Qi = Wa, p= nRT; In—. (23.24) 
VA 
Likewise 
Ve 
Q = Wace = nRT In — (23.25) 
Vp 


where in Q; the ratio of initial to final volumes ¥€ £ appears rather than ¥2 
because Q> has been defined as the heat retel. So 


j=l= -sl (23.26) 
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First notice n drops out as it should, because the efficiency should not de- 
pend on how much gas we take. If we double the gas, we simply double 
Q1, Q2, and W. It turns out we can even cancel the logarithms using the 
following result I will prove in a moment 


Ve Ve 


= (23.27) 
Va Vp 
The final result is simplicity itself: 
T 
=l-—. 23.28 
n T, (23.28) 


In what follows, I will use ņ to always denote this maximum efficiency. 

Now for Eqn. 23.27. We have seen in Eqn. 23.11 that in an adiabatic 
process VT®/ is a constant, that is, for two points x and y on the adiabatic 
curve, 


VeT GF = Vy Te. (23.29) 


Apply this to the pair x = B, y= C and x = A, y= D. Because A and Bare 
at Tı and C and Dare at T, we find 


VaTo!® — vere (23.30) 
VaT — Va (23.31) 


Dividing the left side by the left and the right side by the right we find 
Eqn. 23.27. 

I’ve taken an ideal gas over a cycle and found out that it does function 
like a very primitive heat engine because it takes some heat Q4, rejects some 
heat Q2, and does some work W = Q; — Q; given by the area enclosed by 
the cycle. The ratio of work done to heat absorbed happens to depend 
only on the upper and the lower temperatures. It does not depend on 
the gas. 

I will show that this efficiency is a theoretical maximum. No engine 
can beat the Carnot engine in efficiency, not even one built in 2019. This 
analysis is reminiscent of relativity. To show you why time slows down 
in a moving frame, I took a very simple clock, where a light beam went 
up and down between two mirrors. That’s not your idea of a clock, but 
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you can see why it slows down. But then you know that every clock has to 
slow down the same way, because all clocks in a moving rocket must run 
at the same rate. Otherwise, you can compare the two clocks and find out 
you are moving. Likewise, if you take a very primitive engine, but you can 
show that it is the most efficient engine, you are done finding the upper 
limit on the efficiency of all engines. 

Here is Carnot’s argument, based on the postulate that it’s impossible 
to find a process the sole result of which is to transfer some heat from a 
cold body to a hot body. You have to grant Carnot that postulate, which 
we accept to be phenomenologically valid. Given that postulate, Carnot 
will now show you that no engine can beat his engine. 

The key to the Carnot engine is that it is a reversible engine. That 
means the Carnot engine, starting at the point A, can go backward to D 
then C then to B and then back to A. If the Carnot engine were run back- 
ward, it would look like a refrigerator: it would take in heat Q2 at T>, 
somebody would do work W on it, and it would reject Qı at Tı. That’s 
what were going to use to show that you cannot beat the Carnot engine. 

Here is a concrete demonstration that’s good enough for our present 
purpose. Let us say our illustrative Carnot engine takes 100 calories, de- 
livers 20 calories in work, and rejects 80 calories operating between some 
two temperatures Tı and T). I am taking a particular example where the 
efficiency is y = ot = A = .2. Now, if you say you have a better engine, 
what you really mean is that your engine, also operating between the same 
two temperatures, can take 100 calories and deliver more than 20 calories 
of work, let us say 40 calories, and reject only 60 calories. This is shown at 
the left in the box in Figure 23.5. To disprove your claim, I am going to get 
a Carnot engine that is twice as big as the original one and then I am going 
to run it backward. Mind you, it is not more efficient; it just has twice as 
much gas. What will that Carnot engine, called 2 x Carnot* in the right 
half of the box, do? It will take 160 calories from the colder reservoir, it will 
want 40 calories of work input, and it will dump 200 calories at the upper 
reservoir, as shown in the right half of the box. I want the reverse Carnot 
(refrigerator) to be twice as big for the following simple reason: Your en- 
gine is delivering 40 calories of work per cycle; my refrigerator needs 40 
to run. We can directly take the work output from your engine and feed it 
to my refrigerator. Your heat engine produces work, my refrigerator needs 
work, and I’ve scaled its size so that its appetite matches your engine’s 
output. 


408 Thermodynamics II 


YOUR ENGINE 2X CARNOT* 


Figure 23.5 Ifyou have an engine that is more efficient than Carnot’s it can be 
combined with his (reversed and rescaled) to form a machine whose sole effect is 
to transfer heat from a cold to a hot reservoir. In this case the Carnot took in 

100 calories, did 20 calories of work, and rejected 80. The supposed rival, which 
took in 100, did 40 of work, and rejected 60, is shown in the left half of the box. 
The machine called 2 x Carnot* in the right half of the box is the original Carnot 
reversed and doubled in size. The gadgets taken together (and enclosed within 
the box) equal a third one, shown on the far right, which transfers 100 calories 
from cold to hot with no other effects. 


Now, let’s draw a box around these two guys and not look under the 
hood to see what we’ve got. At the end of a full cycle, when everything is 
done, we find that all the gases, all the pistons, have come back to where 
they were at the outset. No need to plug this gadget into the wall because 
the refrigerator at the right is getting the power from this heat engine at 
the left. I look at the lower reservoir; I see 100 calories leaving: 60 coming 
down and 160 going up. I look at the upper reservoir; I see 200 calories 
in and 100 out. So basically, the combined gadgets are equal to a single 
gadget that transfers 100 calories per cycle from the cold reservoir at T3 
to a hotter one at Tı with no other changes anywhere in the universe, and 
that is not allowed. Therefore, your claim that you have an engine more 
efficient than the Carnot engine has to be false. 

The numbers I picked above are simply representative, but you can 
take any set of numbers as long as your engine does better than mine: 
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instead of 40 calories of work output, it could have been 30 calories. If it 
produced 30, Pll get an engine that is - times as big as the standard Carnot 
engine and run it backward. Your 30 calories will feed my refrigerator, and 
you will find 50 calories of heat flowing from a cold body to a hot body per 
cycle, with no other changes anywhere in the universe. This is how Carnot’s 
engine, even though very primitive, is the standard for all engines. The key 
to the result is that the Carnot engine is reversible. 

The Carnot engine was based on the ideal gas. Given its reversibility, 
we were able to show that no engine could be more efficient. Consider now 
any other reversible engine, running on any substance. We already know 
it cannot be more efficient than Carnot’s. It also cannot be less efficient. 
To show this, repeat our previous argument, but with this reversible one 
running backward, coupled to the Carnot running forward, to produce a 
combination that transfers heat from cold to hot with no other effects. For 
example, if the less efficient but reversible engine took 100 calories and 
converted only 10 to work, you can run it backward and power it using a 
Carnot engine half as big as our illustrative one, which takes in 50 calories 
and delivers 10 calories of work. You should verify that the combination 
will transfer 50 calories from cold to hot in every cycle, with no other ef- 
fects, which is not allowed. The only acceptable result is that all reversible 
engines are equally efficient, Carnot’s being the one that we have studied 
in depth. The efficiency of actual engines will always be less than this limit 
because of losses. 


23.4.1 Defining T using Carnot engines 


We have shown that 
n=1—-— (23.32) 


where the temperature T is defined by the ideal gas equation PV = NKT. 
We can introduce an absolute temperature T, with no reference to any 
specific substance, using reversible engines as follows. 

We define the ratio of absolute temperatures Tı and T, of any two 
objects by the relation 


y= 1 — — (23.33) 
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where 7 is the efficiency of a reversible heat engine operating between T; 
and T2, known to be independent of any details of the engine, the sub- 
stance it runs on, and so forth. Next we fix the absolute values by defining 
T at the triple point of water to be 273.16. We are done. 

Suppose you get me a bucket of some fluid. To find its absolute tem- 
perature Tg, I will run any reversible engine between the bucket of fluid 
(the hot reservoir) and a bucket containing water at its triple point (the 
cold reservoir), measure its 7, and obtain Tg by solving 

273.16 


=1- . 23.34 
1 T; ( ) 


Here I assume Tg > 273.16. If not, I will switch reservoirs. The engine 
can be made arbitrarily small so as not to affect the temperature of either 
bucket. 

Let us leave the practical domain and go back to more theoretical 
issues that were raised earlier, namely, how Carnot’s result can be used to 
define entropy and how the single law of increasing entropy can succeed in 
forbidding all the processes we said were forbidden, like the unmixing of 
hot and cold. 


CHAPTER 24 


Entropy and Irreversibility 


24.1 Entropy 


As promised at the end of the last chapter, we will now go from Carnot’s 
practical considerations on the efficiency of heat engines to the notion of 
entropy. 

In computing the efficiency of the Carnot engine, we found that 
the heat absorbed from the reservoir at Tı and the heat rejected into the 
reservoir at T> are in the ratio (consult Figure 23.4) 


T. 
a (24.1) 
Qa T 
which we can rewrite as 
Qi 6 (24.2) 
T Tr 


Note that in this convention, Q; was the heat absorbed from the hot 
reservoir during AB, while Q, was the heat rejected in CD into the cold 
reservoir. 

Let us add two harmless 0’s on the left and write 


-2| +0=0 (24.3) 
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so that the four terms correspond to the four parts in the Carnot cycle. The 
part AB contributes Qı because Q; was absorbed from the hot reservoir at 
Tı. The adiabatic segment BC contributes the first zero. The segment CD 
contributes the — Q because Q, was rejected into the cold reservoir at T2. 
The last adiabatic segment DA gives the final zero. 

Let us change conventions and uniformly define AQ; to be the heat 
absorbed by the system and rewrite Eqn. 24.3 as follows: 


AQi _ 
> a~ 0 (24.4) 


i 


where the sum is over the different segments in the closed loop. 

Eqn. 24.4 is the heart of the whole entropy concept. It tells us there 
is another state variable lurking around besides the energy U. Remember 
a state variable is one that returns to its initial value when we go around 
a closed loop. The internal energy U is a state variable because if you go 
around a cycle and come back to the same point (P, V), U (which is given 
by U = 3PV for an ideal gas) returns to the old value. If the loop is made 
up of some discrete segments labeled by i (as in the Carnot cycle) and dU; 
is the change in segment i we write 


y\ dU; =0. (24.5) 
i 
If the cycle is some arbitrary continuous loop, we write 


$ dU =0. (24.6) 


By contrast, if you add the A Q;’s around a loop, you will get the area 
enclosed by the closed loop, which is the work done by the system in the 
cycle. So Q is not a state variable. 

Now look at Eqn. 24.4. It says that even though the A Q;’s do not add 
up to zero in a cycle, if we divide each AQ; by the value of Tj, the sum 
vanishes. This suggests that we define a new variable, the entropy S of the 
system, as follows. 
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The entropy S is a state variable that changes by 


eee! 


dS 
T 


(24.7) 


rev 


when heat AQ is added reversibly, that is, with the system arbitrarily close to 
equilibrium. 
By this definition S returns to its initial value after any Carnot cycle: 


2 dS; =0. (24.8) 


However, to merit the title of state variable Eqn. 24.8 should be valid 
for any closed loop, not just the Carnot cycle bounded by adiabatic and 
isothermal curves. In other words we want 


f dS =0 (24.9) 


for all loops. This happens to be true. The proof (which you may skip if you 
wish) relies on the fact that you cannot convert heat fully into work with no 
other effects because this means y = 1, in violation of Carnot’s result. 

Now for the proof that over any closed cycle $ dS = 0. Imagine the 
system going on a loop as shown in Figure 24.1 and an auxiliary reservoir 
at To. Take one loop segment labeled 7. If during this it needed heat input 
A Q;, let it be delivered by a Carnot refrigerator acting between the reser- 
voir at Ty and the system temperature T;. Let A Qo; be the heat extracted 
in this step from the reservoir at Tp, and let A W; be the work needed by 
this refrigerator to pump the heat. Being a Carnot refrigerator it obeys 


A Qoi _ AQ; (24.10) 
To T; ` i 


Summing this over the closed loop (in parts of which AQ; and A Qo; may 
be negative, as in segment j in the figure) we find 


Qo _ f aQ (24.11) 
T, T 
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Figure 24.1 A system is taken quasi-statically through a cycle. In the segment 
labeled 7 it needs some heat A Qj, which it gets from a Carnot refrigerator 
operating between the system’s current temperature and a reservoir at To. The 
refrigerator needs work A W;. If it happens that AQ; < 0, as is the case at point j, 
the heat will be sucked out of the system by a Carnot engine and dumped into 
the reservoir at To. 


where Qp is the total heat extracted from the reservoir. If Qo is zero, we 
have the desired result 


$ “ = $ dS=0. (24.12) 


We will see that this has to be so. (In the expression for dS we use dQ/T 
rather than A Q/T because this ratio stands for the differential dS.) 

If Qo > 0, it means some heat has been lost by the reservoir. Because 
the system and all the auxiliary Carnot engines have come back to where 
they started, by conservation of energy this heat must have been converted 
fully to an equivalent amount of work delivered by all the Carnot engines 
and our system, that is, with 7 = 1, which is impossible. If Qo < 0, we can 
run the whole thing backward (because all steps are reversible in all the 
Carnot engines and our system) and change the sign of Qo to positive and 
get the previous contradiction. 
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Even if you did not follow this argument you must at least note it is 
important that AQ has to be added reversibly if we want to relate it to dS 
by £R = dS. 

Thus there are now two state variables U and S. Because we have 
only defined the change in S, its absolute value is indeterminate up to a 
constant, just like the potential. Using this freedom we may arbitrarily as- 
sign a value Sp to the entropy at some point (Po, Vo) and find its value at 
any arbitrary point (P, V) experimentally by adding the changes dS = Q 
in going from (Po, Vo) to (P, V). We can take any path because each point 
has a well-defined value of S and the difference in S between two points is 
independent of the path joining them. 

To summarize, we know S is a state variable and that its change is 
given by Eqn. 24.7. But we have no idea what this quantity S stands for, 
in contrast to AQ, dV, or dU, which appeal to our intuition directly. But 
even in the early days people like Rudolf Clausius realized, “Here’s another 
variable we have unearthed. We may not fully know what it means, but it 
is a state variable, so we better take it very seriously.” 

Let us bring S into the picture by rewriting the first law 


dU = AQ-— PdV (24.13) 


as follows. Because U is a state variable, the change dU is independent of 
how heat is added to the system. So let us assume it is added reversibly, 
allowing us to write 


dU = TdS — PdV. (24.14) 


Mathematically this tells us U is a function of S and V and that 


aU 
T= (24.15) 
as 
aU 
Pos —, (24.16) 
av 


Because we are now focused on S, let us rewrite the first law as 


1 
dS = 


P 
= —dU + —dvV. 24.17 
T + T ( ) 
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This equation tells us that S (for a fixed amount of gas, say 197 moles) is a 
function of its macroscopic properties: volume V, energy U, and that 


os 
ðU 


1 os 


v T ev 


P 
=. 24.18 
T (24.18) 


U 


These two equations are worth committing to memory. 

Before we develop a feeling for what S means, I want you to get some 
practice calculating the entropy change for a couple of processes armed 
with just 


a0 
= oF 


rev 


dS (24.19) 


First consider m grams of ice that are going to be melted at 0°. We 
have to add the latent heat L = 80cal/g reversibly, that is, add a little heat 
from a reservoir at (0+ €)°C (with e — 0) and let it be fully absorbed by 
the entire specimen of ice and water till it reaches a new equilibrium state 
with a little more water. And then keep doing this till all the ice has melted, 
at which point 


*d L 
5) - $1 = i = = — cal/K where Tice = 273.16K. (24.20) 


ice 
In this simple case where T is fixed at Tice» it can be pulled out of the 
integral, which gives mL. 

Next we consider a more complicated problem: heating m grams of 
water, from T; to T2. The entropy change is 


2 dT Tə 
S2 — Sı = , mow = mcyln T (24.21) 


Once again, remember that the heat should be added reversibly: you don’t 
just dump the water at Tı on a saucepan at T): instead you bring it in con- 
tact with a succession of reservoirs, each infinitesimally above the previous 
one, to take the water from Tı to Th, giving it enough time to equilibrate 
with each reservoir. 

If you cooled it from a higher to lower temperature, you can use the 
same formula and the result will be negative because T) < T. 
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Finally we turn to an entropy calculation involving a gas, whose 
results are going to be very instructive. Take a gas and let it expand isother- 
mally at temperature T from volume Vj to V2 as shown in Figure 24.2. 

As when the ice was melted, T is constant and can be pulled out of 
the integral 


2 dQ 1 2 Q 
s-s= f T=: dQ=> (24.22) 


where Q is the total heat absorbed by the gas. Since U is fixed in this 
isothermal process, Q is the work done by the gas. Thus 


2 dQ 2 PAV 2 nRTdV V2 
S—-S= | ==] —= | —— = nRlÀln—. (24.23) 
IT ; T 1 VT Vi 


If at every point there is a unique entropy, then the entropy difference 
between 2 and 1 should be independent of how we go from 1 to 2. So, 
instead of following the isotherm let us go straight down the P axis to the 
point 0, which has the same volume as 1 and the same pressure as 2, that 
is, Vo = Vj and Pp = P2. The entropy changes are 


V 


Figure 24.2 The entropy change between 1 and 2 at the same T can be 
computed by integrating = along the isotherm or any other path, such as the 
one that goes via point 0. 
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TdT 3R Ty 3R To 
So — Sı = nc / = n— ln — =n—In (because Tı = T) 
a a a ee i 


(24.24) 
T; 5R T 
S2 — So = ncpln = i ln T (because T> = T) (24.25) 
$s 3R To n 5R) T 
al a Rae ies ake 
3R T 2R. T T 
=n ln +n— ln — = nRln — (24.26) 
2 To 2 To To 


=~A-~PN_-A_N) 
Vi? 


pnt (upon usin L 
= V P Sn n PPh È 


! (24.27) 


using many equalities implied in Figure 24.2: T = T; = T, Vı = Vo, and 
Po = P2. So, we can find the entropy change anyway we like. We usually 
pick the easiest path. 

Remember I told you there are many, many phenomena that seem 
forbidden in our world. A lot of things can happen one way but not 
the other way, and we asked what law might prevent all of them from 
happening. Now I’m ready to state the mega law that enforces all that. 


24.2 The second law: law of increasing entropy 


The second law of thermodynamics says 
dS > 0 for the universe. (24.28) 


There you have it; that’s the great law: Any process that reduces the entropy 
of the universe is forbidden. 

Now we have to see how this law can forbid the processes that seem 
disallowed. First consider Carnot’s version of the second law, that you can- 
not have a process in which some heat goes from a cold body to a hot 
body with no other changes anywhere. Let’s first calculate the entropy 
change when the two bodies are reservoirs fixed at T; and T, briefly 
connected by a conducting rod of negligible mass (whose heat content is 
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negligible). Because these are reservoirs they are always in equilibrium and 
have well-defined temperatures, allowing us to invoke 


A 
p (24.29) 
T rev 
So if AQ flows from T; to T>, 
AQ AQ 1 1 
dS = -——+——=A — — — |. 24.30 
Tı $ T : | T z] i ) 


If Ti > T (hot to cold), then dS > 0 and the process is allowed, while if 
Tı < T; (cold to hot), it is not allowed. Notice that energy is conserved but 
entropy increases: even though the energy gains are equal and opposite, 
the changes in entropy are not. 

Next consider some mass m of copper at some temperature Tı and 
an equal mass of copper at T, that are isolated from each other. Each has 
a well-defined entropy. Let us now place them in contact, keeping them 
thermally isolated from the rest of the world. They will end up at a com- 
mon temperature T* = (Ty + T2) just by symmetry. Energy, of course, is 
conserved, and that’s in fact how we determine T*. But look at the entropy. 

The naive answer may be that for the combined system dS = 0 follows 
from Eqn. 24.29 because AQ = 0, which in turn follows from the fact that 
the system was thermally isolated. This is wrong, because the process was 
not reversible; the system was nowhere near equilibrium when the samples 
at two different temperatures were brought together abruptly. There was a 
period of non-equilibrium when they went off the radar, without even a 
globally defined temperature. 

However, after a long time, when they have settled down to T*, 
they do have well-defined entropies, and we can ask what these final 
entropies are. To find them, we forget about the actual process and in- 
stead imagine taking each block from its initial to its final temperature 
in a sequence of steps never far from equilibrium, during which the 
blocks have well-defined entropies and for which dS could be computed 
using Eqn. 24.29. 

Here is an analogy. Suppose you are on some mountain at some point 
1 with height hı. You hike to a new place 2. I want to know the height 
difference h2 — hy. It is simply the sum of the height differences for ev- 
ery step you took. Suppose now that instead of hiking from 1 to 2, you 
used your yogic skills to disappear at 1 and reappear at 2. Even though you 
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were invisible during the transit, there is a well-defined height difference 
between your initial and final locations. However, it cannot be found by 
adding up the height changes for each step you took, because you did not 
get from here to there by taking steps; you went off the radar and reap- 
peared. But I can find the height difference between the end points by 
simply walking myself from 1 to 2 and adding up all the height changes. 
I can take any route I want connecting the same two end points. My hike 
is like the reversible path used to compute the entropy change, while your 
yogic journey is like the actual irreversible process. 

So let us imagine the cooler block being steadily heated by putting 
it in contact with a sequence of reservoirs, each a little hotter than the 
previous one, keeping the block never far from equilibrium and slowly 
bringing it to T*. Similarly, the hotter block is slowly brought down to 

= 5(Ty + T2). The entropy change for the two of them is, in terms of 
the specific heat c: 


T* dT dT 
Sfinal 7 Sinitial = ™¢ E (24.31) 
r| (24.32) 


“ala +In — 


Let us now confirm that the change in S is positive. At every stage, 
when we brought the hot one down a little by sucking out A Q and pushed 
the cold one up a little by pumping in AQ, the entropy change was positive 
because we divide the heat gained by a smaller temperature than the heat lost. 

To absolutely establish the positivity of Sfinal — Sinitial We Need to show 
that T*? > Tı T. Is (Ty + D)? > Ti T? Since this can be rearranged to 
give “Is (Tı — T2)? > 0?” the answer is of course yes. 

So, when hot and cold meet and create lukewarm, the entropy of the 
universe goes up. It follows, therefore, that if lukewarm spontaneously sep- 
arated into hot and cold, the entropy would go down, and that’s why that 
doesn’t happen. That’s why a jar of lukewarm water doesn’t spontaneously 
separate into a cold part and a hot part. Such a separation would not violate 
the law of conservation of energy, but it would violate the law of increas- 
ing entropy. So evolution in one direction is allowed, because entropy goes 
up, but the opposite is not allowed, because that would correspond to a 
spontaneous decrease of entropy. 
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State 1 State 2 


=> 
X 


Figure 24.3 (Left) The gas is originally restricted to the left half of the thermally 
isolated box by a partition. (Right) When the partition is suddenly removed, 

the gas expands and fills the whole box. Ifa movie of this is made and played 
backward, the gas will go back to the left half. This is, however, forbidden by the 
second law. 


Finally, consider the following process depicted in Figure 24.3. 

State 1 refers to a gas confined by a partition to the left half of a ther- 
mally isolated box. The gas has reached equilibrium, and it has spread out 
uniformly over this volume. State 1 corresponds to a point in the P — V 
plane. Now, I suddenly remove the partition. There is a period when the 
gas goes off the radar because it is not in a state of equilibrium. Just after 
the partition is removed, the gas doesn’t have a well-defined pressure: it is 
non-zero in the left half of the box; it is zero in the right half, which is still 
a vacuum. That is why the gas has gone off the P— V plane. After you wait 
long enough, the gas is again in equilibrium, in state 2. 

What is the entropy change now? Once again here’s the wrong way to 
do the calculation. You say, “Well, dS = 4g, but this whole box is thermally 
isolated, and so AQ = 0, and the entropy change is zero.” This could not 
possibly be the right answer. If 2 had the same entropy as 1, there should be 
no obstacle to going back to 1 spontaneously. But we know that the process 
2 — 1 will never happen spontaneously—you cannot put the genie back 
in the bottle. 

So the entropy must have gone up under this “free expansion” against 
a vacuum. We should not find the change in entropy by blindly applying 
dS = 48 because this process was abrupt, and it was far from equilibrium 
in the intermediate stages, whereas the correct formula says the process 
must be reversible, never far from equilibrium: 


28 
— 


dS (24.33) 


rev 
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What are we to do? The answer is the same as with the two copper 
blocks: to forget what actually happened and take the system through a 
sequence of reversible steps from the initial equilibrium state to the final 
equilibrium state. 

Because the end points of the gas are very much present as points 
in the P — V plane, we can find the entropy difference by adding up the 
changes in S along any reversible path connecting the end points. 

While any path will do, there is a very simple path we can use because 
the initial and final temperatures are the same. This is because the gas does 
no work against the vacuum into which it is expanding, and it gets no 
heat because it is thermally isolated. So U2 = Uj, and being an ideal gas, 
this implies T2 = Tı. We may therefore join the end points by an isotherm. 
The entropy change under an isothermal expansion is already known from 
Eqn. 24.23: 


V. 
S — Sı = nRln <. (24.34) 
Vy 


So in this experiment, the way to calculate the entropy change is to 
forget about what actually happened and do the following instead. Take 
the initial gas, put a piston at the halfway point, apply a counter pressure 
to balance the internal pressure, and slowly let it expand, keeping it all the 
while on a reservoir at T. At every stage the dS for the reservoir cancels 
that for the system because the AQ’s are equal and opposite, and their 
temperatures are essentially equal. (I say “essentially” because the reservoir 
had to be infinitesimally hotter than the gas for the gas to absorb heat.) 
In this process, the entropy of the universe does not change at all. The 
entropy gain of the gas balances the entropy loss of the reservoir. But it is 
the entropy gain of the gas that I am after. I used this reversible process as 
a device for finding the entropy gain of the gas alone, because that was the 
same gain it incurred in the irreversible free expansion. 

This was the same story when two chunks of copper that were at Ti 
and T, were brought together to form a chunk at T* = (Ty + T2). In 
the actual irreversible process the blocks were thermally isolated and the 
entropy of the universe A Sy went up, because the universe in this case was 
just the two chunks of copper. Thus in this case, A Sy = A Schunks- However, 
to find the ASchunks, we cooked up a reversible process in which a whole 
sequence of reservoirs was called into play, to slowly bring the blocks to 
their final temperatures. Here the entropy gain for the blocks was the same 
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as in the original process, but that of the universe was zero: at every stage, 
the entropy change of the reservoirs exactly negated that of the blocks, 
because heat transfer between either chunk and the reservoirs took place 
at essentially the same temperature. 


24.3 Statistical mechanics and entropy 


I am finally ready to explain what entropy stands for at a microscopic 
level according to statistical mechanics, whose key ideas I will now briefly 
introduce. 

Consider the free expansion of a gas, and, in particular, Eqn. 24.34 
for the simple case where the gas expands from some volume V; = V to 
Vz = 2V. We saw that 


So — Sı = nRln V2 = nRln2. (24.35) 
Vi 
Note that n, the number of moles, is a macroscopic quantity. For example, 
it is the mass of the hydrogen gas in grams or the mass of the carbon gas in 
grams divided by 12. The universal gas constant R = 2cals/mole is likewise 
relevant to the macroscopic description. 

By the microscopic description, I mean the one where we are armed 
with the fact that the gas is made of atoms and in particular that we may 
rewrite Eqn. 24.35 as 

S-S = nRin 2 = Nkln 2? = NkIn2=kln2% (24.36) 

Vi Vi 

where we have used R = N4k and that the number of atoms is N = nN4. 
The formula S2 — Sı = nRln P was discovered long before atoms were 
proven to exist. You recall that the derivation never referred to atoms. 
Equation 24.36 is the first time the number of atoms has entered the for- 
mula for entropy. Rewriting it in terms of N requires the knowledge that 
there are atoms. From this, or by some other means, Boltzmann divined 
the formula for the entropy of a gas in terms of what the individual atoms 
are doing. That is at the heart of statistical mechanics, which gives you the 
microscopic basis of thermodynamics. 

The truly microscopic theory would follow the evolution of every 
atom using Newton’s laws, thereby tracking the state of the gas in maxi- 
mum possible detail. In practice, this voluminous data for 107° atoms is 
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neither computable nor digestible, if computed. Here is where statistical 
mechanics comes in with a more modest and realizable goal. 

Statistical mechanics focuses on only the following attributes of the 
gas: energy U, volume V, and number of particles N. Despite the micro- 
scopic chaos, with atoms darting about here and there and colliding with 
each other and the walls, these three numbers do not change. They are eas- 
ily measured by macroscopic means, including N, which is the mass of the 
gas divided by the mass of the atoms, presumed known from independent 
microscopic measurements. By contrast, the individual atomic momenta 
and coordinates are perpetually changing. 

Remarkably, the entropy of the gas in equilibrium is determined in sta- 
tistical mechanics solely by these three unchanging macroscopic quantities. 
Boltzmann’s formula for S(U, V, N) is so important that it is carved on 
his gravestone. When we physicists go to Vienna, we skip the orchestras 
and first go to Boltzmann’s grave and read this formula once more with 
reverence. It summarizes a lifetime of work. Here it is: 


S(U, V, N) = kInQ(U,V,N), (24.37) 


where Q(U,V,N) is the number of different microscopic states or micro- 
states of the system compatible with its macroscopic properties, namely U, V, 
and N. We will verify that it gives the correct entropy change under free 
expansion from Vj to V3. 

The first step is to count the microstates of the gas. For this we must 
know what they are. A microstate is a collection of data that fully specifies 
the complete and exhaustive state of the gas, using which its future may 
be determined by Newton’s laws. Thus it is the collection of every atomic 
coordinate and every atomic momentum. We need to count the number 
of states in which every atomic coordinate lies inside the box of volume V 
and the momenta are such that the sum of the individual kinetic energies 
adds up to U. 

To follow the upcoming arguments, you need to know this: If a sys- 
tem has N members (atoms in our example) and each can be in m states, 
independent of what the others are doing, the total number of allowed 
states for the system is m x mx... X m= mN. For example, a coin can be 
only heads or tails (m = 2), and, if we toss 3 of them (N = 3), we can get 
2° = 8 results. If we throw 42 dice, we can get 64? results. Make sure you 
get this. 
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For pedagogical reasons the complete dependence of S on N will not 
be computed here. But we will find that as long as N is fixed, the partial 
derivatives of S with respect to U and V can be evaluated with no error, 
and these will confirm beyond any doubt that Boltzmann’s S indeed cor- 
responds to the one in thermodynamics by reproducing PV = NkT and 
U = $NkT. 

First consider the spatial coordinates. Because each atom is point-like 
in our description, its position is a point. If we equate the number of pos- 
sible positions to the number of points inside the volume V, the answer 
will be infinite no matter what V is! So what one does is divide the box 
mentally into tiny cells of volume a? where a is some tiny number deter- 
mined by our desired accuracy in specifying atomic positions in practice. 
Let us say we choose a = 1076m. In a volume V, there will be V/a? cells 
indexed by i= 1,2,..., V/a. We label the atoms A, B, and so forth, and we 
say in which cell each one is. If A is in cell i = 20 and B in cell i = 98000, et 
cetera, that’s one microscopic arrangement. We can assign them to other 
cells and in case we permute them, say with A— B —> C —> D —> A, that 
is counted as another arrangement (except when two exchanged atoms are 
in the same cell). Thus when the gas is restricted by the partition to volume 
Vj, and each of the N atoms has V, /a® possible cell locations, 


v15 
Q= =| , (24.38) 
a 
and 
Vil V 
Sı = kln D = NkIn. (24.39) 
a a 
If, after free expansion, the volume is V2, 
V2 1% V. 
S: = kin 2 = NkIn =. (24.40) 
a a 


Notice that S depends on the cell size a. If we change a, we will change 
S by a constant, because of the In a? term. This is unavoidable until quan- 
tum mechanics comes in to specify a unique cell size. However, changes 
in S will be unaffected by the varying a, just as adding a constant to the 
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potential energy does not affect the application of the law of conservation 
of energy Kı + Uj = Ky + U2. 

But there is another problem with this result. The state of the atom 
is not given by just its location, but also its momentum p. Thus every Q 
above should be multiplied by a factor Q,(U) that counts the number of 
momentum states open to the gas at a given value of U. Again one divides 
the possible atomic momenta into cells of some size. Whereas the atoms 
could occupy any spatial cell in the box independently of the others, now 
they can only assume momentum configurations in which the total kinetic 
energy of the gas adds up to a given fixed U. 

Thus, the formula to use is 


vq 
Q= || x Q(U) (24.41) 
S(U, V) = Nin +kInQ,(U). (24.42) 


Luckily we do not need to evaluate Q,(U) in this example because U does 
not change during free expansion and so neither does Q,(U). It will drop 
out in the difference S2(U, V2) — Sı (U, V1). Later we will compute Q,(U) 
for the general processes in which U varies as well. 

The change in entropy is 


V2 Vy V2 f 
So — Sı = Nkln — — Nkln — = NkIn — = Nkln2 in our example 
a a Vi 
(24.43) 


independent of a and in agreement with what we got from thermodynamics, 
before we knew there were atoms! However, in earlier times S2 — Sı was 
written in terms of the macroscopic attribute, the number of moles n and 
the macroscopic parameter, the universal gas constant R = 2 cal/mole: 


V: 
25 nhn: (24.44) 
V2 


Because a drops out, in computing the change in S, we sometimes 
choose the cell size a? to be half the volume for pedagogical reasons. In 
this case each atom has only two positions: left or right. If initially all are 
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in the left (with the partition in place) there is just one way, Q; = 1, Sı =0, 
while if both choices are open (with the partition removed), Q = 2% and 
S2 = Nkln2 and S2 — Sı = Nkln2 as before. 

If every equilibrium state has a unique entropy (up to a constant) 
what do we mean when we say S is maximized in equilibrium? 

The preceding case of free expansion illustrates the answer to this 
common question. Initially the gas occupies the left half and is in equilib- 
rium. The entropy assigned is S1. Nothing will change if we leave it alone, 
including Sı. Now we remove the partition or constraint and wait until 
a new equilibrium is reached. The law of increasing entropy says that the 
new S2 will be greater than the old Sı. Thus the entropy increased, not 
when the system was in equilibrium, but when some external conditions 
keeping it there were changed to allow a new equilibrium state. In other 
words, the change in entropy is a result of our lifting some constraint (the 
partition) that made available a new equilibrium state to the system that 
was initially constrained to live in the old equilibrium state. Removing the 
constraint can only open up more options (or keep them the same), which 
is why S either rises or stays put. We will return to this theme shortly. 

Here is another example. Two different gases A and B occupy the two 
halves of a partitioned box. The two halves settle down, reach equilibrium, 
and have some total well-defined entropy Sjnitial. This entropy does not 
change with time. Suppose we now remove the constraint, the partition. 
The separated gases do not represent an equilibrium state in the absence 
of the partition. There will be macroscopic changes as the gases begin to 
mix, and there will be a period when S is not defined. Finally the system 
will assume a new equilibrium state with both gases uniformly spread out 
over the entire box and an entropy Sgnal > Sinitial- 

So here is what statistical mechanics and in particular S = kln Q have 
done for us. 


e Given us a microscopic basis for entropy that is a lot more compre- 
hensible than dS = Ag and that is also capable of reproducing all 
thermodynamic results, as illustrated in the case of free expansion. 

e Made it clear why S will go up (or stay fixed) in a spontaneous process: 
when a constraint is removed more states Q become available. 

e Explained why A — B happens spontaneously but not B — A: because 
the macroscopic state B can be realized in many more ways than state A. 

e Given us a deeper and more accurate picture of equilibrium states. 
Consider the gas after the partition is removed and it fills the whole box 
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evenly. This uniform density corresponds to our image of the equilib- 
rium state in thermodynamics. Statistical mechanics tells us that while 
this is typically what happens, small deviations from this will occur with 
a calculable probability determined by the following sole postulate of 
classical statistical mechanics. 


Postulate of statistical mechanics: In equilibrium, every allowed micro- 
scopic state of an isolated system has the same probability. 


It is this postulate that ensures S will go up in a spontaneous 
process when a constraint is removed: less constraints mean more 
microscopic states are allowed. 

To illustrate the other implications of this postulate for an ideal 
gas, let us assume every atom has just two positional states: left or right 
half of the box. There are 2% allowed microscopic states and all occur 
with the same probability. But this does not mean every possible value 
of a macroscopic observable is equally probable. The fraction f = 4 of 
particles in the right half is a macroscopic variable. Not every f is equally 
likely. Consider the configuration with all the particles in the left half: 
n= 0 and f =0. It can occur in only one way. It is like tossing 107° coins 
and getting all heads—possible but highly improbable. If you now let 
one of the particles be in the right half, that is, n = 1, there are N ways 
for this to happen because there are N choices for who gets to be the 
odd atom. So this situation is N times more likely than the one with all 
in the left half. More generally, the situation with n atoms in the right 
and N — nin the left, by elementary combinatorics, can occur in 

N! 


ways. Therefore f = 4 is Q (N, n) times more likely than f = 0, the one 


with all atoms in the left half. 
As n increases, so does Q(N,n), which reaches a maximum at 


Q = Qmax = ——. (24.46) 


Beyond the halfway point, Q falls and reaches the value 1 when n= N. 
Figure 24.4 shows Q (30, n). This graph allows us to say, for this gas of 
30 atoms, what the odds are for any partitioning of atoms between left 
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Figure 24.4 The number of microscopic configurations with N — n particles on 
the left half of the box and n on the right, for N = 30. 


and right. If every value of n is allowed, it means that in equilibrium 
the density will be inhomogeneous and not be uniquely determined by 
U and V. But this is not the situation to which statistical mechanics 
is applied. Rather it is intended for a typical gas for which N ~ 10”. 
Now the distribution will become very sharply peaked at f = .5 and the 
chances of getting anything really different from a 50/50 split will be 
negligible. It can be shown that typically f will not deviate from f = .5 
by more than a number of order 1//N. Thus the variable f will lie in 


the range f = f £ Tt Likewise, deviations from the average P will exist 


but will be relatively very small, and one may associate P with the P in 
PV = NKT as N > ow. 

In general the likelihood of obtaining a certain value for a macro- 
scopic observable in equilibrium is proportional to the underlying 
number of microscopic arrangements. 


We now return to our discussion of entropy. It is often said that en- 
tropy is a measure of how disordered your system is. To have all the atoms 
of the two species confined to two different halves of the box is perfectly 
natural if there is a partition to enforce the separation. However, once the 
partition is removed, this condition would be unnatural and extremely 
ordered in the face of the newly opened possibilities. This initial order, 
introduced by design, will quickly and spontaneously evolve into a fully 
mixed state. Thus opening the partition and allowing the gases to mix 
increases the disorder. The technical measure of disorder is the (log of 
the) number of microscopic arrangements that can lead to a given macro- 
scopic situation. The spontaneous evolution from order to disorder under 
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the removal of a constraint occurs because there are so many more ways to 
be disordered than to be ordered. 

Here is another example. You take a container with a vertical par- 
tition, and pour clear water in one side and red water in the other. You 
have an ordered state. It can last forever as long as there is a partition. If 
you remove the partition, the red and clear water will soon mix, because 
there’s no reason for the red particles to stay on one side forever. Eventually 
the mixture will become pink. This disordered configuration is inevitable, 
because there are many more ways to remain pink than to remain sepa- 
rated. Likewise, pink spontaneously separating into red and colorless, the 
emergence of order from chaos, is very, very improbable and points in the 
direction of lower entropy. 

Although the unmixing seems disallowed by the law of increasing 
entropy, it is not strictly disallowed, just overwhelmingly unlikely. For ex- 
ample, if you are in one side of a room, there is a chance of one part in 
roughly 1/ 210”) that all the air molecules spontaneously end up in the 
other half. But don’t hold your breath; it is not likely to happen in the re- 
maining age of the universe. The second law of thermodynamics may be a 
statistical law, but when it says something is “unlikely” it means “so over- 
whelmingly, ridiculously unlikely you should forget about its happening in 
the entire history of the universe.” 

Be aware of one thing: the entropy of a part of the universe can go 
down. It’s just the entropy of the whole universe that cannot go down. All 
of life is an example of lowering entropy: the creation of tomatoes out of 
mud and fertilizer is a highly organizing process where the entropy is really 
going down. But if you keep track of the rest of the world, you will find that 
there is a greater increase of entropy somewhere else. Or take your freezer: 
your refrigerator sucks heat out of your freezer, and the entropy of the 
freezer goes down, but elsewhere in the room there’s bigger increase due 
to the heat emitted from the back of the refrigerator. If you take account 
of everything, the entropy of the universe will go up or remain the same; it 
will never go down. 


24.4 Entropy of an ideal gas: full microscopic analysis 


We will now compute the entropy (up to a constant) of an ideal gas of N 
atoms, energy U, and restricted to a volume V using Boltzmann’s formula. 
We will assume N never changes but that U and V could. So we want 
S(U, V). 
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The energy of an ideal gas is entirely kinetic and independent of 
particle positions. We have to find the logarithm of the number of states 
Q(U, V) compatible with the prescribed values of U and V. 

We have already seen that 


N 
Q(U,V) = [5] Q,(U) (24.47) 


where V/a? counts the possible positions for each atom and Q,(U) is the 
number of microscopic arrangements of momenta for the entire gas that 
respect the condition that the total kinetic energy be U. (In computing the 
change in S during free expansion we ignored Q(U) because U was the 
same before and after free expansion. Here we want U to vary and so need 
to work a little harder and find Q,(U).) 

The internal energy is (for the allowed configuration with every atom 
inside the box) 


i=l i=1 


N ge depi oe 
Pix aa Piy T Piz : 
= —— where p = my is the momentum. 


: 2m 
i=1 
(24.48) 
Let us now form a vector P with 3N components 
P= (Pix Piy Piz Prx- : «> PNz)> (24.49) 


which is simply the collection of the 3 components of the N momen- 
tum vectors pj. If we renumber the components of P with an index j = 
1,225 3s 

P= (Pj, P2,... P3n), (24.50) 


that is to say, 


Pi = pix, P2 = piy, P3 = Piz P4 = P2% - -o P3N = Ponzo (24.51) 
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we may write 
2 
U=y\`—., (24.52) 


Regardless of their position, the atoms can have any momentum as 
long as the components satisfy Eqn. 24.52. So we must see how many possi- 
ble momenta exist obeying this condition. The condition may be rewritten 
as 


3N 
SP? =2mU. (24.53) 
j=l 


This is the equation for a hyper-sphere of radius R = /2mU in 3N 
dimensions just as 


x? + y? =R? isacircle or 1-sphere and (24.54) 
xX? +y? +27 =R? isthe usual sphere or 2-sphere. (24.55) 


In mathematical literature the circle in d = 2 and the usual sphere in 
d = 3 are both referred to as spheres, and the circumference and area are 
both referred to as areas. The areas are 27 R and 4x R? respectively in the 
familiar examples. By dimensional analysis, a sphere of radius R in d di- 


mensions has an area that goes as R4~!. In our problem R = | /2m T| 


and d = 3N — 1 ~ 3N. If we divide the individual momenta into cells 
of size b°, which like a? is small but arbitrary, the total number of states 
allowed to the gas behaves as 


Q(V, U) = VN U?4/? Fm, N, a, b) (24.56) 


where we have focused on the dependence on U and V and lumped 
the rest of the dependence on m, a, b, and N in the unknown function 
F(m, N,a, b). We do not need F because we just want to take 


3N 
S=knQ= k| Ninv+ Ea u| + kin F(m,N,a,b) (24.57) 
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and find its V and U partial derivatives, to which F makes no contribution. 
These derivatives are 


—| = (24.58) 
Vly V 

as 3kN 

—| =, (24.59) 
aU. 2U 


If we identify the two derivatives above with i and + following Eqn. 
24.18, that is, as they were in thermodynamics, we obtain 


kN P 
a which is just PV = NkT (24.60) 
3kN 1 
oF which is just U = å NKT . (24.61) 


Thus we are able to derive these equations of state of the ideal gas from the 
Boltzmann definition of entropy. 

With more work we could get the full N-dependence of Q as well. It 
has interesting consequences, but I will not go there, leaving it to you to 
pursue the topic on your own. 

Two final remarks on Boltzmann’s formula. First, 


S=kInQ (24.62) 


is valid not only for the ideal gas but any thermodynamic system. However, 
computing Q is generally impossible except for some idealized models. 
Next, consider two systems that are independent. Then 


Q=Q) x %, (24.63) 


that is, the number of options open to the two systems is the product of the 
numbers open to each. This ensures that the total entropy, S, is additive: 


S= Sı + $. (24.64) 
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24.5 Maximum entropy principle illustrated 


I have already explained briefly what it means to say that the entropy is 
defined only in equilibrium, and also that it is a maximum in equilibrium. 
I resume that discussion with an illustrative example. 

Imagine a box of gas separated by an insulating partition, with ener- 
gies U? and U? in the left and right halves respectively. The initial entropy 
of the system is 


Sieg = kln [Q1 (UP) - 22 (U9)] (24.65) 


in obvious notation. The two sides may not be at the same temperature, 
but there is nothing they can do about it, because of the insulating bar- 
rier. Suppose we now let the barrier conduct heat, so that energy can flow 
between the two sides. 

Intuitively we expect that energy will flow till the two sides have 
reached the same T. On the other hand, statistical mechanics says the fi- 
nal state will be one of maximum entropy. We will now verify that the two 
conditions are equivalent. 

Let us find out how statistical mechanics determines the final parti- 
tion of the total energy 


Up = U? + U2. (24.66) 
The new equilibrium entropy is 


Uo 
Sı42=kln| $` Q(U) -%2(U = Up — Ui) (24.67) 
U1=0 


where the sum runs over all possible assignments of Uj, from the lowest 
value (assumed to be 0) to the maximum, the entire energy Up. The initial 
entropy comes from just one term in the sum with U; = U? and Uz = U3, 
a condition imposed by the barrier. 

Without the barrier, all the partitions of the total energy are allowed. 
The final entropy is obviously larger, because all the extra terms in the sum 
are positive. But the crucial point is not that there are more terms, but that 
there are usually some new individual terms that are astronomically larger 
than the initial one Qı (U?) - Qo(U9). The largest term can be found by 
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maximizing the product Qı (U1) - Q2(U2 = Up — Uj) with respect to U. 
Let UŤ be the value of U; that maximizes the product and let Uy = Up — 
U;. Let us maximize the logarithm of the product instead, to find 


0= dln [Q (U1) . Qy (U2 = Up = U;)] 


24.68 
dU, (24.68) 
din QU; dlInQ3(U2)| dU. 
= Dei + al 2 (24.69) 
1 lU 2 lux AU) 
1 1 E 
=n EE (because = nmi). (24.70) 


In other words, the largest term corresponds to the partitioning of energy such 
that the temperatures are equal on the two sides. This confirms the asser- 
tion that the equality of final temperatures when the partition becomes 
conducting is synonymous with maximizing entropy. 

The product Q; (Uj) - Q2 (U2 = Up — U1) drops off very rapidly as we 
move away from the maximum at UŤ¥. So Q142, the sum over partitions, 
or “the area under this graph,” is then Qyax- W, where W is some effective 
width, which will be some tiny fraction of Up, the maximum allowed range 
for U1. We will see that the details of W will not matter. Continuing, 


O.5= CT OW (24.71) 
Si42 = kln Qı (Uf) + kln Q3 (UF) + kln W (24.72) 
ae Sı (UŽ) + S2(U5). (24.73) 


We have dropped the In W term compared to the first two terms, which are 
typically of order NIn U (see Eqn. 24.57) with N ~ 107%. It is often the case 
in statistical mechanics that the sum over terms is replaced by the largest 
term (or the area under the graph is replaced by the maximum height of 
the function), with negligible error in the logarithm, which is eventually 
and inevitably taken. Physically, it means in the example under consid- 
eration that, even though the total energy can be divided in all possible 
ways, you are not likely to find any division that differs macroscopically 
from the one at the maximum. Observe that only when we approximate 
the sum over partitions with the dominant term (which causes an ut- 
terly negligible error in the logarithm) does the total Q)42 become the 
product of the individual ’s, and the total S}42, the sum of individual 
entropies. 
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Let us get a sense of the numbers involved. Imagine that initially we 
had ideal gases with N atoms each on both sides, with the temperature and 
energy on the right being three times that on the left: 


U?=U U=3U. (24.74) 


The initial entropy, with the insulating partition is, from Eqn. 24.57, 


5 3Nk l U 41 3U 
ee n n 
initial 7 N N 


+ terms that are unaffected by the partition. (24.75) 


The final state with equal T and hence energy 2U in each side has entropy 


3Nk 2U 
Sfnal = 2 X co in =| + the same unaffected terms. (24.76) 


The change in entropy is 


3Nk 4 
Sfinal — Sinitial = 5 In 3° (24.77) 


which means the ratio of allowed configurations is 


3N/2 
Ofinal = B í ~ 10187-10” (24.78) 
Q initial 3 
for N = 10%. 


You are invited to show that if the conducting partition is also mov- 
able, the maximization of entropy will require that the total volume will be 
shared in such a manner that 


Ph R 
ay 24.79 
kTi kT, aes 


which reduces to Pı = P because Ti = T>. 
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24.6 The Gibbs formalism 


Josiah Willard Gibbs was the greatest homegrown physicist produced by 
the United States. He spent his whole life at Yale, where his father was a pro- 
fessor of sacred languages. He obtained his bachelor’s degree and doctorate 
there, joined the faculty, and taught until his death in 1903. He seems to 
have been an extraordinarily modest, generous, and cheerful person, in 
addition to being an exceptionally deep and original physicist. Among his 
many contributions to mathematics, physics, and chemistry, I will single 
out his alternative to Boltzmann’s statistical mechanics. 

Recall that Boltzmann considered an isolated system, with a con- 
served energy U. He postulated equal probability for every allowed 
microscopic state of this energy. The central function was the entropy, 


S(U) = kInQ(U), (24.80) 


where Q(U) was the number of states available to the system with energy 
U. (We hold V and N fixed for this discussion.) When internal constraints 
were removed, the systems would try to maximize entropy. The absolute 
temperature emerges as a partial derivative: 


os 
a0 T which can be rewritten as (24.81) 
ôn 1l (24.82) 
ôU kT l 


In contrast to Boltzmann, who gave a statistical description of iso- 
lated systems with a definite energy U, Gibbs wanted to describe systems at 
a definite temperature by virtue of being in thermal equilibrium with a heat 
reservoir at a fixed T. For example, the system could be a gas, confined to 
a heat-conducting box, placed inside a gigantic oven at that T. 

Due to the coupling to the reservoir, the energy of the system can 
vary without limit, and the goal now is to find the probability P(i) that the 
system would be found in a particular state 1, of energy ¢;, and to find the 
appropriate version of the law of maximizing entropy. 

There is no need for a new postulate. All results will follow from the 
fact that the system plus reservoir is an isolated system, with a fixed total 
energy Up, to which Boltzmann’s treatment applies. 
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Let Qr(U) be the number of states available to the reservoir when it 
has energy U. Let the lowest energy state of the system carry an index 0, 
and let us set the corresponding energy £ọ = 0 for convenience. When the 
system is in the one particular state 0, the reservoir has the entire energy Uo, 
and it can be in any one of Qr(Up) states. Thus the system plus reservoir 
can be in one of 1 x Qr(Up) states. If the system is in one particular state i 
of energy £; > 0, the number of states available to the reservoir plus system 
is now reduced to 1 x Qr(Up — £;i). Since every state of the joint system 
is equally probable, the ratio of probabilities for the system to be in state 7 
versus state 0 is simply the ratio 


PQ) _ 1x QrR(Up — €i) 


P(0) 1x QR(Up) (24.83) 


Let us take the logarithm of both sides and manipulate as follows: 


| P(i) 
In| —— 
PO) 


| = In[Qr(Up — €;)] —In[Qr(Up)] (24.84) 


AlnQr(U) 1 67InQr(U) 
=- ————| at- ———| i+... 
dU ly, 2 ðU Up 
(24.85) 
Ei 
=- +... 24.86 
a” ( ) 


In going from Eqn. 24.85 to Eqn. 24.86, I have recalled the definition of 
the temperature T of the reservoir (Eqn. 24.82), 


ôlnQg(U 1 
ĉlnOR(U) ae. (24.87) 
oU kT 


and dropped all but this first derivative in the Taylor series. The second and 
higher derivatives are successive derivatives of 1/kT, and they vanish since 
the T of a reservoir, by definition, remains fixed no matter what energy 
the system has. (It is like when you stick a thermometer in your mouth to 
measure body temperature, the drop in body temperature is negligible.) 
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Remember, we do not require the system itself to be small, only that the 
reservoir be overwhelmingly larger. 

Taking antilogarithms of both sides of Eqn. 24.86, we find the ratio 
of probabilities to be 


PCi) 2 e7 2il kT 
PO) l 


(24.88) 


From the ratio of probabilities we may construct an absolute proba- 
bility P(i) (which gives 1 when summed over all values of 7): 


oe ei/kT 


Pi) = where (24.89) 


Z= x etal AT (24.90) 


We call Z = Z(T) the partition function and e~*‘/*T the Boltzmann weight. 
Many interesting quantities can be deduced from Z(T). For example, 
the average energy is 


ej/kT 

Ū= 2 P= a (24.91) 

rs 2 Sek 1 êZ 
rl; : l= =kT?.—.— (24.92) 

Z oT 

InZ(T 
= ghz) (24.93) 
aT 


If we know Z(T) in closed form, we can extract the average energy by dif- 
ferentiation. It is a lot easier to compute Z(T) than Q(U), because the sum 
over 7 is unrestricted in energy. 

As an illustration, consider an ideal gas in a box of volume V, in 
contact with a reservoir of temperature T. Choose as the system just one 
atom, and consider the rest of the gas and the reservoir as part of a new 
reservoir. (The Gibbs approach applies to systems of any size, as you may 
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verify by going over the derivation above.) Its partition function is 


1 ji CO CO CO 
4(T) = 3 dxdydz - — dpxdpydpz 
a box b —oo J — O0 J — OO 


+p +o 
S 24.94 
gl 2mkT ee 
= VT?’ f(m, k,a, b) (24.95) 


where f(m, k,a, b) is a function that will not affect the mean energy since 
it is T-independent. In performing the p-integrals, I introduced rescaled 
variables wy = p,//T et cetera, and wrote each p-integral as T!/? times 
an integral over the corresponding w that did not depend on T. Now we 
see that for one atom, 


= olnZ,(T 
i =e aa) (24.96) 
OT 
ô [in T? +1In V +1nf(m,k, a,b 
= kT" | a )] (24.97) 
oT 
3 
=-kT. (24.98) 
2 
For an ideal gas with N non-interacting atoms, 
- 3 
Un = Pu (24.99) 


In Gibbs’s approach, T is fixed, Uy can fluctuate, and Uy is its average. It 
can be shown that as N > œ, deviations from Uy are negligible compared 
to Un. It is this average, with negligible (relative) fluctuations, that plays 
the role of the internal energy U in thermodynamics, for which statistical 
mechanics provides a microscopic foundation. 

The maximum entropy principle applied to the system and the reser- 
voir may be expressed in terms of the system alone as follows: When 
internal constraints are removed in a system in equilibrium with a reser- 
voir at some T, the system evolves to minimize its free energy F(T) defined 
as follows: 


F(T) = —kTInZ. (24.100) 
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I will not prove this here, and I hope that you are now motivated to pursue 
the subject on your own. 


24.7 The third law of thermodynamics 


We have no time to delve into the third law of thermodynamics, which 
declares that the entropy of all systems approaches 0 as T — 0. You will 
realize that quantum mechanical considerations must have gone into this 
assertion, because entropy in classical statistical mechanics is defined only 
up to a constant. In quantum theory, the allowed states of a system are dis- 
crete and countable. As T — 0, we see from Eqn. 24.88 that any state with 
energy higher than the ground state (chosen to have £o = 0) has vanishing 
probability. The system has only one accessible quantum state, which en- 
sures that S = 0. This law is definitely true for all systems of finite extent. 
Occasionally, infinite systems could have multiple ground states. Physical 
considerations show that if the system is found in one of these ground 
states, it will never evolve into any of the others. 
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Exercises 


Note to instructor: I have only given a few hints and want the problems to 
challenge the very best. Please give additional hints as necessary to suit your class. 


PROBLEM SET 1, FOR CHAPTER 1 


Exercise 1.1 A particle moves with velocity v(t) = 81° — 67. (i) How far does it 
travel between t = 1 and t = 2? (ii) What is the average velocity in this interval? 
(iii) When is the acceleration most negative and what is that value? (iv) Find the 
average a in the first two seconds. 


Exercise 1.2 A particle moves as per x(t) = 5 — 10t + 5¢°. (i) Find its average 
velocity v between t = 0 and t = 1 and (ii) between t = 0 and t = 2. (iii) Does it 
ever go to x < 0? (iv) Confirm the answer by rewriting x(t) as a perfect square. 


Exercise 1.3 A particle moves as per the equation x(t) = 3 + 4t + 6f? + 44° 
where x is in meters and ¢ in seconds. (i) Find its velocity and acceleration for 
all times. (ii) When does its velocity ever equal 12% (iii) At that instant what is its 
acceleration? 


Exercise 1.4 (i) Show that if a particle is undergoing uniform acceleration, its 
. A x = Vp Vi Xp Xi 

average velocity over a time t may be written as v= or as ~—— where f and 

i refer to final and initial values. (ii) If a is not uniform, which definition should 


be used and why? 


Exercise 1.5 A particle travels at velocity v(t) = 3¢? for 1s and then at v(t) = 3e7* 
forever. How far does it travel? 


Exercise 1.6 A rock falls toward a static planet with velocity v(x) = 3 @ where 
x is their separation. If initially the rock was at x = 0 and the planet at x = 10°m, 


when does the rock hit the planet? Hint: Split dx and dt. 


Exercise 1.7 A three-person relay has the runners going 30m each at velocities 
62,5", and 10%. What is y, their average speed? 
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Exercise 1.8 A particle travels a distance L in two ways. (i) First it travels half the 
distance at vı and the other half at v2. What is the average velocity v? (ii) Suppose 
it travels half the time at vı and half the time at v2. What is v? (iii) Take the differ- 
ence and show that one average is definitely bigger. (Assume all v’s are positive.) 
Explain. 


Exercise 1.9 A three-person relay has the runners going equal distances at 
velocities v1, v2, and v3. What is the average velocity? Apply this to Exercise 1.7. 


Exercise 1.10 An object is dropped from rest, and its position is measured at 
times 0, A,2A,..., etc. Show that the ratio of the distances it travels in any two 
successive intervals is the ratio of successive odd integers. 


Exercise 1.11 Bond and Scaramanga get into a helicopter which begins to accel- 
erate upward at a 7 1 for 3s at which point Scaramanga gets thrown out. He bites 
the dust 4.92s after liftoff. (i) What was a? (ii) From what height did Scaramanga 
get ejected? 


Exercise 1.12 I drop a stone into a well and hear a splash T seconds later. Show 
that the depth of the well h is the solution to , / a + # = T, where 1; is the velocity 


of sound. Solve for h by setting y = h? and finding y first. Crunch the numbers if 
T= 25v = 343%. 


Exercise 1.13 From the top of a building of height h = 100m I throw a stone 
up with velocity 10%. (i) What is the maximum height it reaches? (ii) When is 
that? (iii) How many seconds does it spend on its way down between h = 50m and 
h= 0m? (iv) What is its velocity when h = 50m? (v) If when it is airborne I quickly 
dig a hole 50m deep, when and with what speed will it hit the bottom? 


Exercise 1.14 Jerry throws a loaf of bread to George, who is in front of an open 
window whose lower edge is at height H and whose upper edge is at H + h. 
(i) What should be the initial upward speed of the bread if it is to just make it 
to the bottom of the window? (ii) If it is to just make it to the top of the window? 
(iii) How long does George have to grab the loaf in the second case? 


Exercise 1.15 A car going at 50 i sees a speed limit sign Vmax = 30 a at a distance 
of 100m. What is the minimum deceleration for compliance? 


Exercise 1.16 A car traveling at v = 25 decelerates at a = =55 ™. (i) How far 


does it travel before coming to rest? (ii) How long does that take? 
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Exercise 1.17 A car starts from rest, accelerates at a = 13 for 500m, and decel- 
erates at the opposite rate for the next 500m. (i) What is its maximum velocity? 
(ii) What is the total travel time? 


Exercise 1.18 A plane lands at speed vp and decelerates uniformly and comes 
to rest after time T. How far did it travel on the runway? Do the numbers for 
vo = 300", T = 60s. 


Exercise 1.19 A pilot traveling at speed vp notices a parked truck a distance d 
away on the runway. He wants to lift off before crashing into the truck by reaching 
Vm, the minimum speed for takeoff. (i) What is a, the minimum acceleration he 
needs? (ii) Crunch the numbers for vp = 200, d= 2,0007, and Vin in = 3008, 


Exercise 1.20 Two cars are approaching each other at speeds vp and are a dis- 
tance d apart. What minimum deceleration will avoid a collision? Hint: Where 
should they come to a stop in the extreme case? 


Exercise 1.21 Romeo is at xp = 0 at t = 0 when he sees Juliet at xy = 6m. 
(i) He begins to run toward her at vg = 57. She in turn begins to accelerate 
toward him at aj = =23° When and where will the lovers cross? Sketch their 
motions, measuring time horizontally and position vertically. (The quadratic 
equation xp(t) = x(t) has only one solution.) (ii) Suppose instead she moved 
away from him with positive acceleration a. Find amax, the maximum a for which 
he will ever catch up with her, and the time t of their contact. Interpret the result 
te = v/a geometrically as the condition for xj(t) to be tangent to x(t) at t = te. 
Show that for smaller values of acceleration a they will cross twice. Draw a sketch 
and explain in words why they cross twice. (The quadratic equation x(t) = xy (t) 
has either no solution for a > amx or two for a < Ajax, Which coalesce for a= amax. 
This is the condition for xj(t-) to be tangent to xr(t,).) 


Exercise 1.22 A rock is launched upward from a height H with speed vp. After 
T seconds a second rock is dropped from rest and both hit the ground at the same 
time. What is T? 


Exercise 1.23 Ball A is dropped from rest from a building of height H exactly 
when ball B is thrown up vertically with speed vp. When they collide A has dou- 
ble the speed of B. If the collision occurs at height h, what is h/H? Hint: Write 
equations for heights y4, yg and velocities v4, vg. Solve for te, the collision time. 
Use |v4| = 2vg at te to show that H = 23 /3g. Use these results to find h/H. 


Exercise 1.24 Consider two particles with positions x(t) = 3t + 4t — 9 and 
x(t) = 4t? — 5t +5. Find the time(s) when they meet, the meeting points, and 
their velocities at the meeting points. 
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Exercise 1.25 Consider two particles with positions xı (t) = 3t? + 4t — 9 and 
x(t) = 4t? + wt +7. (i) Find value(s) w such that they meet only once. (ii) Argue 
geometrically that at any such meeting they must have equal velocities. Find the 
common velocity vc at the meeting. 


PROBLEM SET 2, FOR CHAPTER 2 
Exercise 2.1 Show that if A+B is perpendicular to A — B, then A = B. Show that 
d(A-B) dB dA 
dt dt dt 


Let A = 3i + 4j and B = 5i — 6j. (i) Find A+ B, A — B,2A + 3B, and C such that 
A+B+C=0. (ii) Find A, the length of A, and the angle it makes with the x-axis. 


Exercise 2.2 Consider the primed axis rotated relative to the unprimed by an 
angle ¢ in the counterclockwise direction and the relation 


Ay = A, cos — Ay sing 
Ay= A, cos na sing 


that expresses unprimed components in terms of primed components of a vector. 
Show that the dot product of any two vectors is unaffected by the rotation. 


Exercise 2.3 A particle is located at r(t) = 14ti + 6t°j. (i) Find its position, ve- 
locity, and acceleration at t = 2s. (ii) Find its average velocity and acceleration for 
the first two seconds. 


Exercise 2.4 I am passing a train 800m long traveling at 207 in my car trav- 
eling at 30% in the same direction. (i) How long does it take me to pass it? 
(ii) What if I am traveling in the opposite direction? 


Exercise 2.5 A train is moving with velocity vrg = 3i+ 4j relative to the ground. 
A bullet is fired in the train with velocity vgr = 15i — 6j relative to the train. What 
is the bullet’s velocity vgg relative to the ground? 


Exercise 2.6 A boat B that can travel at 5 on still water is trying to cross a river 
h ying 

R of width 300m flowing at 3 km and reach the point directly opposite the starting 

point on the shore S. Use vgs = var + vrs, draw the figure, and find the angle vgr 

makes with vgs and the time to cross. 


Exercise 2.7 What is the radial acceleration of a geosynchronous satellite orbit- 
ing at R= 4.2 x 107m from the center of the earth? 
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Exercise 2.8 Estimate the acceleration of the moon toward the earth given that 
it orbits in 28 days at a distance of a quarter of a million miles. 


Exercise 2.9 A jet pilot diving vertically down at 600 mm wants to make a quarter 
turn, experiencing a maximum acceleration of 5g. (i) What is the minimum height 
at which the turn should begin? Assume that the trajectory is circular and the 
speed is constant and that after the quarter turn the plane is moving horizontally 
at ground level. (ii) What force does the seat exert on the pilot just before the 
horizontal part? 


Exercise 2.10 What is the centripetal acceleration of a point on a flywheel of 
radius 1m rotating at 6,000 rpm? If a piece from the edge broke off, how far would 
it travel before hitting the ground, which is 2m below? 


Exercise 2.11 A projectile is shot horizontally from a height of 450m at a speed 
vy = 250%. (i) When does it hit the ground? (ii) Where? (iii) What is v,/v, at 
impact? 


Exercise 2.12 Consider a particle shot at velocity vp at an angle 0 above the hor- 
izontal. Show that tan ¢ = 5 tan where tan¢ is the slope of the line joining the 
initial point to the highest point. 


Exercise 2.13 Consider a particle shot at velocity vp at an angle 0 above the hor- 
izontal. (i) Find x and y when 2 = tan y. (ii) Look at your answer for y/x and see 


if it agrees with what is expected at the beginning, middle, and end of the flight. 


Exercise 2.14 A shooter wants to hit a target R meters away and at the same 
height by firing a bullet at speed vo. (i) At what height h above the target should 
he aim? (ii) Simplify in the small angle limit. (iii) Do the numbers for vp = 250%, 
R= 100m. 


Exercise 2.15 At a wedding the bride throws her bouquet from height 2m with 
a speed vo = 25% at an angle 37° (sin37 ~ 3/5) above the horizontal. It is caught 
by a friend at height 1.5m. (i) How long is the bouquet in flight and how far did 
it go horizontally? (ii) At what time ¢* was its height at a maximum and what was 
this height? 


Exercise 2.16 A baseball launched horizontally at speed vo fell by .88m while 
traveling 17m horizontally. (i) What was vo? (ii) At what (small) angle above hori- 
zontal should it have been launched at the same speed to arrive at the same height 
17m away? 
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Exercise 2.17 At what speed vp should a rock be thrown at a 45° angle above 
horizontal if it is to just clear a wall 15m high at a distance of 20m? After how long 
does it clear the wall? 


Exercise 2.18 When a high speed train suddenly begins to decelerate at —55, a 
kid sleeping on the upper bunk 3m above the floor falls off. Where does he land? 


Exercise 2.19 In a train moving at constant velocity a ball thrown up will come 
right down to the person’s hand. If the train has an acceleration a, show that 
for this to happen it must be thrown at angle 0 from the vertical in the forward 
direction, where tan? = a/g. 


Exercise 2.20 I want a projectile launched at speed v at angle 0 above the hori- 
zontal to just make it to the top of a building of height h a distance d away. What 
0 should I use? 


Exercise 2.21 A batter hits a ball at a speed vo and an angle 0 above the hori- 
zontal. At that instant a fielder x9 meters from him along the line of flight runs to 
catch the ball at a fixed speed w. (i) Find the w that will put him right at the ball 
when it lands. (Ignore the height of the fielder and batter.) (ii) Does the answer 
change if both have the same height? 


Exercise 2.22 Recall that for a given launch speed vo the range is the same for 
launch angles @ and 6’ = 5 — 9. Find the ratio of the flight times for the two 
options. 


Exercise 2.23 A projectile is shot with speed vp at an angle 0 above the horizontal 
from a cannon at the foot of a hill of slope a. (i) What is the range R (measured 
along the hill)? Hint: What is the relation between x and y when it hits the hill? 
(ii) Show that for R to be a maximum you need cos20 + sin2@- tana = 0 or 
= 7+ o: 


Exercise 2.24 A monkey is hanging from a height h, and a person at a horizontal 
distance d meters away from the tree and on the ground wants to zap it with a 
tranquilizer dart. He aims straight at the monkey and fires, and the monkey lets 
go at the same instant. Show that the person will succeed, and find the monkey’s y 
when it is hit. Show that the initial speed must obey vo > \/(d? + h?)g/2h for this 
to be an acceptable solution. What does this condition ensure? Hints: Write the 
equations for x and y coordinates of dart and monkey. Show that when the dart 
hits the tree, dart and monkey have the same y coordinate. The condition on vo 
ensures that the dart can reach the tree before the monkey hits the ground. 
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Exercise 2.25 In the previous problem write a formula for the vector separation 
r(t) = rm — rq between the monkey and the dart assuming the dart was fired 
at t = 0. Show that r(t) does indeed eventually vanish provided, as before, vo > 


V(@ + h?)g/2h. 


Exercise 2.26 My enemy, who is at (x = y = 0), launches a projectile at speed vo 
and angle @ above the positive x-axis. Simultaneously I fire my intercept projectile 
from (x = R,y = 0) at speed vj at an angle 0’ above (clockwise from) the negative 
x-axis. I want the intercept to take place at x = 2. (i) What are vj and tan’ in 
terms of vp and tan? (ii) What is the time ¢* and height y* of the intercept? 


Exercise 2.27 Consider two particles with trajectories rı (t) = vit — 5 gt?j and 
r2(t) = Rit+-vot— 5 gt?j as seen by someone on the ground. Compute their veloc- 
ities Vig and vzg relative to the ground. Now consider an observer O falling freely 
under gravity, starting at x = 0,y = 0 at t = 0 with velocity Vog = —gtj relative 
to the ground. (i) Compute vio and v0, the velocities of the two bodies relative 
to O. Hint: Vog = —vgo. (ii) Integrate these velocities with the given initial condi- 
tions and show that the particles move along straight lines in the xy plane as seen 
by O. It is the intersection of these lines that enters the solution of the previous 
problem. 


Exercise 2.28 Two robbers, one on the ground and the other on a roof, point 
their guns at each other and fire simultaneously. Given no further information, 
show that the bullets will collide unless the calculated collision point turns out 
to be underground (y < 0), which is not allowed. For the case where the muzzle 
speeds are vı (robber on the ground) and v (other guy) show that the condition 


for colliding above ground is vı + v > T Fca where d is the initial horizontal 


separation and @ is the angle of elevation of the line joining the robbers. 


PROBLEM SET 3, FOR CHAPTERS 3 AND 4 


Exercise 3.1 Lying on a frictionless table is a mass m connected by a massless 
string to a mass m to its right, which in turn is pulled to the right by a force of 
F as shown in Figure 3.4 in Chapter 3. (i) Find the acceleration a’ if the string 
has a mass of ms. (ii) Draw the free-body diagram of the three objects and com- 
pute the forces on each of them. (iii) Repeat if there is a non-zero coefficient of 
friction u between the masses and the table, denoting by a” the common ac- 
celeration. Optional: Crunch the numbers for Fı = 10N,m, = 3kg,m2 = 2kg, 
ms = 100g, u = 0.2. 


Exercise 3.2 Suppose a rocket has an engine that can provide an acceleration of 
ay > g when the rocket is fired horizontally. To what height h will it go when fired 
vertically with its engine on for f seconds and how long (T) will it stay afloat? 
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Exercise 3.3 Imagine a circle of radius R in the vertical plane. A tiny bead starts 
at the top point A, and slides down along a wire that follows the diameter to the 
antipodal point B. Another bead slides down along a chord AC that leaves A at an 
angle 0 from the vertical and reaches the circle at a point C. Show that both beads, 
falling under gravity, take the same time to finish their journeys. Hint: Introduce 
lines joining the center of the circle to A and to C as an aid to computing the length 
of the chord AC. 


Exercise 3.4 A mass M resting on a table with kinetic friction coefficient uy is 
attached to a massless string whose other end goes over a pulley and is attached 
to a mass m that can move up and down. (i) What is the acceleration of M along 
the table (till it hits the pulley) assuming m is large enough to overcome static 
friction? What is this minimum m? (ii) With what speed and at what time does M 
strike the pulley if it is a distance d away? 


Exercise 3.5 A 2kg mass on a frictionless table is connected by massless strings 
to two other masses: a 1kg mass hanging over the left edge of the table and a 3kg 
mass over the right (with a pulley on either edge as required). Compute (i) the 
acceleration of the masses and (ii) the tension on the two strings. 


Exercise 3.6 The left end of a spring of force constant k is anchored to a wall 
and the right end is touching (but not connected to) a mass m at x = 0 resting on 
a frictionless table. The mass is pushed to the left by an amount A, compressing 
the spring by A and setting x(0) = —A. The restraining force is suddenly removed 
at t = 0. (i) Describe qualitatively the motion of the mass for t > 0. (Remem- 
ber the spring can push the mass to the right but not pull it back.) (ii) Write an 
equation for x(t), the position from t = 0 to t = tọ, when the mass loses contact 
with the spring. What is tọ? Hint: Relate this to the oscillator problem and Eqn. 4.8. 
(iii) Write an equation for x(t) for t > tọ. (iv) If there is a non-zero coefficient of 
kinetic friction yz, for x > 0, where will the mass come to rest and when? 


Exercise 3.7 A spring of force constant k is in a relaxed state between two equal 
masses m which lie on a frictionless table. Next, each mass is moved a distance 
A toward the center of the spring. Thus initially the spring is compressed by 2A, 
and the masses have equal and opposite coordinates as measured from their equi- 
librium positions. The force holding them in place is suddenly removed and the 
spring is allowed to expand. (i) Will the spring move as a whole? Argue that the 
masses will always have equal and opposite velocities and coordinates. (ii) Why 
will each mass think the spring has a force constant 2k? (iii) With what speed will 
the masses emerge? (iv) When will they lose contact with the spring? Hint: For 
(ii) and (iii) relate the movement of the masses to the suitably adapted oscillator 
solution Eqn. 4.8. 
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Exercise 3.8 An object of mass m is hanging from two taut strings as shown in 
the leftmost part of Figure E-1. Draw a free-body diagram and derive an expres- 
sion for the tensions T; and T on the two strings. Give the numerical values when 
m= 10kg,a=%,and b= 5. 


Atwood machine and free-body diagram 


Figure E-1 


Exercise 3.9 Consider the Atwood machine shown in the middle third of 
Figure E-1. The ropes connected to the masses and the pulley are massless and 
there is no slip. Recall the argument on page 63 of the book showing that the ten- 
sions on the ropes supporting mı and m are equal: Ti = T, = T. (i) Draw the 
free-body diagrams for the masses and compute their accelerations a; and ap, de- 
fined to be positive upward. Argue that a; = — a by following the string. (ii) What 
is T? 


Exercise 3.10 In the middle part of Figure E-1 let m = 200kg stand for Scara- 
manga and a cabin, while m = 100kg is an empty cabin. Now add to the picture 
Bond, mass 100kg, hanging on to the cable 100m above the empty cabin. At this 
point nothing is moving. If he lets go, when will he land on the cabin below him? 


Exercise 3.11 Consider the setup in the right third of Figure E-1. Find an expres- 
sion for the acceleration of mı and the tension on the string it hangs from. Hint: 
Relate the acceleration of m to that of m by asking by how much m moves down 
if mı moves up by some amount. Remember also that the tension is the same at 
any point on the string. 


Exercise 3.12 In Exercise 3.9 consider the upward force Tọ acting on the pulley 
from which the two masses are hanging. You might think To = (mı + mz)g. That 
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would be correct if the masses were not accelerating, which in turn means m = 
m. Find the correct answer using the fact that To = Tı + T2 = 2T supports the 
entire Atwood machine against gravity. (i) Setting To = meg, what is the effective 
mass Me supported by To? (ii) Show that it is less than the sum of the masses. Why 
is this? Consider the extreme case when one mass vanishes. 


Exercise 3.13 The Double Atwood: Suppose the mass m on the right in 
Figure E-1 is replaced by another Atwood machine supporting m3 and mm as 
shown in Figure E-2. (i) Using the notion of effective mass from the previous 
problem, compute ay, the acceleration of my. (ii) Check the special case m = m3. 


Figure E-2 Double Atwood machine and free-body diagram 


Exercise 3.14 The Double Atwood-Redux: Solve the problem by going back to 
basics without using the notion of effective mass as in Exercise 3.13. Look at 
Figure E-2. On the pulley P you have the string pulling down from both sides 
with force T (which is the tension on the string). The same force T acts upward 
on m and the pulley P’. Since P’ is massless, T = 2T’. If you ride with P’ you will 


find that relative to it m and m3 have accelerations azp = a’ and azp = —a’. Use 
relations like ayg = ap + ap’g to write all accelerations relative to the ground, and 
relations like aig = —ap’g to arrive at the final result. 


Exercise 3.15 A block of mass m sits atop a mass M which rests on a frictionless 
table. The mass M is connected to a spring of force constant k attached to the wall. 
(i) How far can mass M be pulled so that upon release, the upper mass m does not 
slip, given that the coefficient of friction between the two masses is ju? (ii) Repeat 
if x’ is the coefficient of friction between M and the table. 
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Exercise 3.16 A car moving at 70%% rams into an immobile steel wall. The front 
of the car is compressed by 0.94m. What average force must a seat belt exert in 
order to restrain a 75kg passenger? 


Exercise 3.17 A 7.2kg mass is hanging from the ceiling of an elevator by a 
spring with force constant k = 150% and an unstretched length of 80cm. Show 
that the overall length of the spring when the elevator has an acceleration a is 
X = 80cm+ mate, Find X when the elevator (i) starts moving upward with ac- 
celeration 0.95%, (ii) moves upward at a steady 14°, and (iii) begins to decelerate 
from this velocity to rest, taking 9.0s to do so? (iv) If the elevator measures 3.2m 
from floor to ceiling, what is the maximum acceleration it could undergo without 
the 7.2kg mass hitting the floor? 


Exercise 3.18 A mass is hanging by a rope from the ceiling of a bus at rest. What 
angle does the rope make with the vertical when the bus accelerates by a positive 
amount a? 


Exercise 3.19 Two springs have the same unstretched length but spring con- 
stants kı and kz. Find the effective force constant k, if they are connected (i) in 
series and (ii) in parallel. (To find ke, imagine stretching the combination by an 
amount x and keeping track of the force needed.) 


Exercise 3.20 You stand on a merry-go-round spinning at f revolutions per sec- 
ond, R meters from the center. What is the minimum coefficient of static friction 
Ls between your shoes and the floor that will keep you from slipping? 


Exercise 3.21 An amusement ride consists of a pod suspended by a cable. 
(i) When the pod spins at a frequency f around a horizontal circle of radius R, 
what is the angle 8 the cable makes with the vertical? (ii) If the angle 9 = 7 and 


w= 174, what is L, the length of the cable? 


Exercise 3.22 (i) A massless wire of length a has one end fixed at the origin and 
a mass mat the other end. To this mass is welded another massless wire of length b 
at the other end of which is attached another mass m. The whole system is rotating 
about the center at angular velocity œ. What is the ratio of the tensions in the inner 
and outer wires? (ii) A mass m tethered to a massless rope is spinning in a vertical 
circle, keeping its total energy constant. Find T; — Ty, the difference in the tension 
between the topmost and bottommost points. 


Exercise 3.23 A person of mass M leaves the North Pole for the equator and is 
delighted to find he weighs less. Explain and compute (using a free-body diagram) 
the reduction in weight as a fraction of the weight at the North Pole. Assume the 
earth is a perfect sphere of radius R = 6.4 x 10°m. 
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Exercise 3.24 A particle of mass m moves at a steady speed vo on a frictionless 
circular hoop of radius R in the vertical plane, with gravity acting downward. 
(i) What radial and tangential forces, F, and Fg, does the hoop exert on the particle 
when it is at angle 0 above the horizontal? (ii) What is the condition on v9 to 
ensure the hoop never exerts a force more than 5mg? (iii) Where on the hoop is 
the point of maximum radial force? 


Exercise 3.25 A mass m moving along a circle of radius R on a frictionless table 
is connected to a massless string which passes through the hole at the center and 
terminates on a second mass M which is free to move vertically. If M is stationary, 
find (i) the tension in the string and (ii) the period of circular motion. 


Exercise 3.26 In an “amusement” ride the victims line up against the walls of a 
drum of radius R which begins to spin faster and faster. At some point the floor 
drops out and the petrified thrill seekers hang there supported by the coefficient 
of friction u with the walls. What is the minimum frequency of rotation before 
the floor can be safely dropped? 


Exercise 3.27 A bead sliding around a horizontal loop of radius R with friction 
coefficient u has a speed vo at t = 0. (i) Find v(t). (ii) Find the angle by which the 
bead has rotated by time t. 


Exercise 3.28 A road is banked so that a car moving at speed vo can take the 
curve of radius R with no friction between the tire and road. If the coefficient of 
static friction is u, show that the range of permissible speeds (with no radial slip- 


` sos tanĝ— u v2 tanĝ+u 
ping) Is §iven by 1+ytan0 = Rg = l— utan 


0 in the absence of u. Hint: Resolve F = ma along i and j and remember that 
the maximum frictional force (up or down the incline) is uN. Argue that the 
road with banking and friction is equivalent to a frictionless road with banking 
Oe = 0 +a where tana = u and + correspond to going over or under the old 
speed limit vo. Use the formula for tan(A + B). 


2 
where tan? = Pe describes the banking 


Exercise 3.29 A 1,000kg car is on a country road which has its ups and downs 
or maxima and minima in height. When it is at a maximum with a radius of 
curvature of R = 20m, what is the maximum speed at which the car can maintain 
road contact? If it is at the bottom of a minimum of the same radius, moving at 
the same speed, what force F, does the ground exert on the car? 


Exercise 3.30 The handle of a 22kg lawn mower makes an angle a = 35° with 
the horizontal. If the coefficient of friction between lawn mower and ground is 
u = 0.68, what magnitude of force Fe is required to push the mower at constant 
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velocity? Assume the force is applied in the direction of the handle. Compare with 
the mower’s weight. 


Exercise 3.31 A roller coaster of mass M is at the top of the loop-the-loop 
of radius R at twice the minimum speed possible. What force does the track 
exert on it? 


Exercise 3.32 Consider Figure 4.4 in Chapter 4 with us the coefficient of static 
friction between M and the plane. Let masses initially be at rest. Imagine mass m 
is varied continuously to cause motion. Find (i) the range of values of m for which 
M will begin to slide downhill as m is reduced and (ii) the range when M will 
begin to go uphill as m is increased. Between these values motion up or down is 
possible if we give the masses a little push because ug < us. What is the range of 
m when such motion is possible? 


PROBLEM SET 4, FOR CHAPTERS 5, 6, AND 7 


Exercise 4.1 If F(x) = Ax*/?, find the work done by it in moving a body from 
x=O0tox=X. 


Exercise 4.2 Find the line integral of F = i2xy* + jx? from (0,0) to (1,1) along 
(i) x = y, (ii) (0,0) > (1,0) > (1, 1), and (iii) (0,0) > (0,1) > (1,1). 


Exercise 4.3 Find the line integral of F = iy? +jxy + kxz from (0,0,0) to (1, 1,1) 
along (i) x= y= z and (ii) x=t,y=f,z=P. 


Exercise 4.4 Are the following fields conservative? If yes, what are the potentials 
U(x, y)? (i) F = iysinx + jcosy, (ii) F = i2xy? + j3y’x’, (iii) F = isinx + jcosy, 
and (iv) F=icoshxcoshy+jsinh xsinhy. 


Exercise 4.5 You have to choose between a 225-W fridge that costs $1150 and a 
425-W model that costs $850. The first runs 11% of the time and the second runs 
20% of the time. If electricity costs 9.5 cents/kWh, when will you break even? (Use 
hours as the unit of time at first; go to days to report the final answer.) 


Exercise 4.6 In the Atwood machine (Figure E-1), let mı = 14kg be held at a 
height 5m above the ground and let m = 8kg be at rest on the ground. The masses 
are released from rest. (i) Find the velocity of the 14kg mass just before it hits 
the ground in two ways: using F = ma and energy conservation. (ii) Find the 
maximum height reached by the 8kg (and don’t worry about its hitting the pulley). 


Exercise 4.7 A loop-the-loop car of mass m comes down an inclined track from 
a height h and enters the bottom of the loop. (i) What is the minimum h (in terms 
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of the loop radius R) that will ensure a safe trip? (ii) What will be the speed of 
the car at the lowest part of the circle? (iii) What will be the force exerted by the 
track on the car at this point? (iv) If after the car finishes the loop it begins moving 
along a horizontal track with coefficient of friction x, how far will it travel before 
stopping? (v) If instead of using gravity a spring of force constant k were to give it 
the initial speed, by what amount A should it be compressed? Get the number for 
part (v) if m= 840kg; R= 6.2m; k = 31 x 109%. 


Exercise 4.8 A block of mass m comes down an inclined plane at angle & froma 
height h after being given a slight tap to get it going. If the coefficient of friction is 
Lk What is the velocity when it reaches the ground? Find this using Newtons Law 
as well as energy principles. 


Exercise 4.9 A mass m falls from a height h onto a spring of force constant 
k. Show that the maximum spring compression is (mg/k) (1 +./1+2kh/ mg). 


Show that the maximum speed attained is Vinay = ./2gh (1 + st) . Hint: At what 
x does it stop accelerating downward? 


Exercise 4.10 A child climbs up a ladder of height H to go down a slide which 
ejects her horizontally from a height h. Where does she land? Where would the kid 
land if this had been on the moon, which has a sixth of the gravity of earth? 


Exercise 4.11 A mass is placed on top of a vertical spring and causes it to con- 
tract by d. If it is pushed down a further amount 5d and released, how far above 
the lowest point will the mass rise? 


Exercise 4.12 A skateboarder of mass m begins to slide down from the rim of 
a hemispheric bowl of radius R and drops a height R, At this point, find (i) the 
centripetal force on the skater, (ii) the normal force N exerted by the bowl, and 
(iii) the radial and tangential accelerations, ag and ar. 


Exercise 4.13 A particle begins to slide off the top of a bowling ball of radius R. 
There is no friction. (i) Find the angle 0, measured from the vertical, when the 
particle loses contact with the surface. (ii) Repeat if the particle started out with a 
velocity vo. 


Exercise 4.14 A particle of mass m is in a potential V(x) = ax? + bx‘. (i) Find 
its minima if a,b > 0. (ii) Repeat if a = —|a|. (Plot the function to see how 
many minima there are.) (iii) Find the frequency œ of small oscillations about 
the minima. 
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Exercise 4.15 A particle of mass m is in a periodic potential V(x) = 
—Acos(2ax). (i) Where are its minima? (ii) What is the frequency of small 
oscillations about a minimum? 


Exercise 4.16 Find the work done by a force F = ix? y? +jx°y’ acting on a body 
from (0, 0) to (1, 1) along three paths: (i) first along x and then along y, (ii) at 45°, 
and (iii) along the curve y = x2. (iv) Show that this is a conservative force and find 
the potential energy U (x;y). Find the work done earlier in terms of difference in 
U. (v) Repeat parts (i) and (ii) for F = ixy? + jxy. 


Exercise 4.17 With what speed vp must a particle be emitted from the bottom of 
a vertical circular track of radius R so that it can go around the top without falling? 


Exercise 4.18 Show that the acceleration due to gravity at a height h above the 
earth can be approximated (for small h) by g, = (1 — 2h/Re)go where go is just 
g and Rz is the radius of the earth. What is g, on top of Mount Everest? Hint: 
(1+x)n21+4+ nx forx << 1. 


Exercise 4.19 Find the period of the circular orbit of our sun around the center 
of our galaxy (taken as a point mass 4 x 104! kg) at a radius of 3 x 104 light years. 


Exercise 4.20 A meteorite 80,000km from the earth is moving toward the earth 
at 2,000“. Ignoring air friction, what will be its velocity on impact? 


Exercise 4.21 Calculate the radius of the geosynchronous orbit about the earth. 


Exercise 4.22 A rocket is launched at escape velocity from the surface of the 
earth (radius Rg). What is its velocity when it is at a distance r from the center 
of the earth in terms of Gand Mg? 


Exercise 4.23 Two planets of mass m and M start at rest an infinite distance 
from each other and begin to fall toward each other. Argue that since they exert 
equal and opposite forces on each other, it follows that mı vı + mv = 0. Use 
this result to show that when they are a distance d apart, their relative velocity 


v n= / 2Glm +m) , 


Exercise 4.24 A satellite is in an elliptical orbit around the earth with altitude 
(measured from the surface of the earth) ranging from 230 to 890km. At the high 
point it is moving at 7.23. What is its speed at the low point? 
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PROBLEM SET 5, FOR CHAPTER 8 


Exercise 5.1 Consider a square of mass 4kg, side 2m, and negligible thickness, 
with its sides oriented along the usual axes and its center at (0,0). (i) Determine 
its CM using symmetry arguments. (ii) Imagine that the 1m x 1m part of it in the 
fourth quadrant is chopped off. Where is the new CM of this object? Do it first by 
considering the three 1m x 1m squares that constitute the object as point masses 
located at their CM’s and then find the CM of the three point masses. Next, re- 
derive this, treating the chopped-off shape as the full square plus a 1m x 1m square 
of negative mass —1kg in the fourth quadrant. (iii) A disk of radius R centered at 
the origin has a circular hole of radius R centered at (x = R, y = 0). Where is 
its CM? 


Exercise 5.2 Find the CM of a cone of radius R and height h. (Think in terms of 
slices of thickness dy at height y.) 


Exercise 5.3 From a rectangle of width 20cm and height 10cm a circle of radius 
5cm centered on the left half is cut out. (i) Where is the CM? (ii) What if the circle 
had its center at the geometric center of the rectangle? 


Exercise 5.4 Two identical stars of mass M orbit around their CM. (i) Show that 
T? = 2r R , where R is the distance between planets. (Draw a figure and keep 
track of the gravitational force on either star as well as its centripetal acceleration.) 
(ii) Repeat for two unequal masses m and M, and show T? = 4n*R’ where R is 


: “ G(M+m) 
the separation between stars. 


Exercise 5.5 Consider a massless boat of length L on frictionless water. At the 
left end is a person Pı of mass m; holding a backpack of mass m3. At the right end 
is a person P, of mass m. At t = 0, P; throws the pack toward P at a speed v. 
(i) What is V, the magnitude of the velocity of the boat and passengers relative 
to water when the pack is airborne? (ii) How long does the pack take to reach P3? 
(iii) During this time how much has the boat moved to the left? (iv) Locate the 
final CM (relative to the original location of the left end of the boat) and show it 
is the same as at the beginning. 


Exercise 5.6 A disgusted reader of mass M = 32.5kg on ice flings my textbook 
weighing m = 2.25kg at vp = 12%. The book is thrown from zero height, and the 
total distance between the book and the reader is 15.2m when the book lands. 
(i) At what angle was this excellent book thrown? (ii) How fast is the reader 
moving at the end? 


Exercise 5.7 Block A of mass m is moving at velocity +v toward block B of mass 
2m, which is at rest. To block B’s right and at rest is block C of mass m. Find the 
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ultimate velocities of all three masses assuming all collisions are elastic. Verify that 
your final answers satisfy conservation of momentum and energy. 


Exercise 5.8 Two identical frictionless billiard balls are symmetrically hit by a 
third identical ball with velocity ivọ as in Figure E-3(a). Find all subsequent ve- 
locities following this elastic collision. (Draw a picture at the moment of collision. 
What does “no friction” say about the direction of the forces?) 


Tree branch supporting vine 


> 


(a) 


Figure E-3 (a) Exercise 5.8 (b) Exercise 5.9 


Exercise 5.9 Ideal Zorro (mass M, no height), standing at the edge of a cliff 
of height H, swings down on a vine of length L, grabs a kid of mass m (zero 
height, standing on the ground), and barely reaches safety at a height h, as de- 
picted in Figure E-3(b). Relate H to the other parameters. What is H in meters if 
L= 40m, M = 100kg, h = 6m, m = 30kg? 


Exercise 5.10 Show that if momentum is conserved in one frame of reference, 
that is, the sum of the initial momentum vectors equals the sum of the final ones, 
it is conserved in another frame moving at velocity u relative to the first. 


Exercise 5.11 A particle of mass m moving at vo collides with another of mass 
M and comes to rest. (i) What is the velocity of M? (ii) What is the loss in kinetic 
energy? 


Exercise 5.12 An astronaut of mass 100kg wants to return to his spaceship, 
which is 200m away, by throwing his wrench in the opposite direction. (i) If he 
got to the ship in 10s, what was his velocity relative to the ship? (ii) If the wrench 
had a mass of 20kg, what was its velocity relative to the ship? 
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Exercise 5.13 A bullet of mass 4g traveling at 300% passes through a block of 
mass 1200g initially at rest on a frictionless table and emerges imparting to the 
block a velocity v = 0.5%. The bullet then moves on to get embedded in a second 
block of mass 1500g. (i) What is the final velocity of the second block with the 
embedded bullet? (ii) What is the loss in kinetic energy? 


Exercise 5.14 A proton collides with a static alpha particle (which is four times 
as heavy) and emerges at an angle +65° with respect to the initial direction, while 
the a emerges with a speed 4 x 10° = at an angle of —45°. (i) What is the proton’s 
final velocity? (ii) What was its initial energy? 


Exercise 5.15 A 2kg projectile with a velocity 20i% strikes a 4kg target at rest. 
The projectile emerges at a velocity v = 10(icos30° — jsin30°)%. (i) Find v’, the 
velocity of the target after the collision. (ii) Is this collision elastic? 


Exercise 5.16 Show that when mass mı moving with velocity vı has a to- 
tally inelastic collision with mass m moving at v2, the loss of energy is AK = 
my m(vi— v2)" 
2(m +m) 

a speed 100%, while a car of mass 1,000kg is approaching the origin from the 
right at a speed 5082, They are 1km apart. (i) Where is the CM relative to the car? 
(ii) What is the velocity of the CM? (iii) If they collide and stick together, what will 
be the velocity of the combination? (iv) What is the loss in kinetic energy? (Use 
the formula you derived above.) 


. A truck of mass 2,000kg is approaching the origin from the left at 


Exercise 5.17 Consider two masses mı and m moving at vı and v along the 
x-axis. (i) What is V, the velocity of the CM? Go to the frame of the CM (i.e., 
to a frame moving at V). Find (ii) the velocity of each particle and (iii) the total 
momentum before and after the collision. Why is the answer expected? 


Exercise 5.18 Imagine two sleds A and B each of mass 200kg at rest on ice, with B 
to the right of A. A kid of mass 40kg jumps from A to B and back with an airspeed 
of 27 relative to the ground. What are the final velocities v4 and vg? Hint: Use 
momentum conservation at each stage. For vg you can shorten the calculation by 
equating the change in the kid’s momentum during his interaction with sled B to 
(minus) that of the sled. 


Exercise 5.19 A neutron of mass m collides elastically with the following parti- 
cles at rest: (i) a proton (mass m), (ii) a deuteron (mass 2m), and (iii) an a (mass 
4m). What is the ratio of its final kinetic energy to its initial kinetic energy? 
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Exercise 5.20 A 3kg mass is acted upon by a force F = 3i + 6j, and a 4kg 
mass is acted upon by a force F = 9i + 4j. What is the acceleration of 
the CM? 


Exercise 5.21 A firework launched at 20% explodes into two identical fragments 
at the top of its flight. If the second one reached the ground 3s after the explosion, 
when did the first? 


Exercise 5.22 A spaceship of mass 800kg (when empty) moving at vj = 20 


needs to reach 200 by firing its rocket, which emits gas at vp = 508% relative to 
itself. How much fuel does this take? 


Exercise 5.23 Find the final speed of a rocket of mass 1,000kg that starts at rest 
carrying 2,000kg of fuel and emits gas at vo = 2, 000 in its frame. 


Exercise 5.24 What multiple x of its empty mass m must a rocket carry if its 
terminal velocity is to equal vo, the velocity of the exhaust gases in its frame? 


Exercise 5.25 A rocket that can carry twice its mass in fuel reaches a velocity vp 
when half the fuel is used up. What will be its terminal velocity vr? 


PROBLEM SET 6, FOR CHAPTERS 9, 10, AND 11 


Exercise 6.1 A car goes around a track of circumference 4km in 1 minute. What 
is its (i) œ, (ii) tangential velocity vr, and (iii) centripetal acceleration a,? 


Exercise 6.2 Calculate the moment of inertia of a cylinder of mass M and radius 
R through its axis of symmetry. 


Exercise 6.3 Three identical masses m are located at the three corners of an equi- 
lateral triangle of sides a in the xy-plane, with its base on the x-axis. Find their CM 
and moment of inertia I with respect to (i) an axis through the CM parallel to the 
x-axis, (ii) an axis through the CM parallel to the y-axis, and (iii) an axis through 
the CM parallel to the z-axis. 


Exercise 6.4 What is the moment of inertia of an L-shaped object in the xy-plane 
with sides of equal mass M, length a, and negligible transverse dimensions with 
respect to (i) an axis parallel to the z-axis and through the corner and (ii) an axis 
parallel to the z-axis and through the CM (which lies outside the object)? 


Exercise 6.5 (i) What is the moment of inertia Icy of a propeller with three 
blades (treated as rods) of mass m, length L, at 120° relative to each other? (ii) Ifa 
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torque t acts on this how long will it take to reach an angular velocity œ? (iii) How 
many revolutions will it have made before reaching this w? (iv) Get the numerical 
answers if L= 1.25m, m= 12kg, t = 3,000N- m, œ = 2,000 2%, 


Exercise 6.6 Consider a rectangle of sides a (along the y-axis) and b (along the 
x-axis). Show that the moments of inertia about the symmetry axes along the x 
and y directions are $5 Ma? and EM’, respectively. (Rotate the rectangle about 
one of these axes and think of it as composed of rods.) Going back to the very 
definition of I, show that if this rectangle is spun around an axis through its CM 
and perpendicular to its area, the moment of inertia will be I = iM (a? +b’). 


Exercise 6.7 A planar disk of radius R has an areal mass density p (r) = A- r x, 
where A is a constant and r is the distance from the center. What is its moment of 
inertia for rotations around a perpendicular axis passing (i) through the center of 
mass and (ii) through a point on its circumference? Express Icm and Iedge in terms 
of R and M, the mass of the disk. Argue that both should be greater than 5 MR’, 


the answer for uniform density. 


Exercise 6.8 By direct integration, show for an axis passing through the center 
that I = 2 MR? for a solid sphere and J = 3 MR? for a hollow sphere. (Consider 
slices perpendicular to the axis.) 


Exercise 6.9 Argue that A - (A x B) = 0. In three dimensions find the expression 
for A x B in terms of vector components and i,j, and k. 


Exercise 6.10 (i) Relate (A x B) to the area of a parallelogram of sides A and B. 
(ii) In three dimensions relate (A x B) - C to the volume of a certain solid. 


Exercise 6.11 Consider a planet moving around the sun, bound by F,, the radial 
force of gravity. Show that L = r x p is conserved. Next assume the fixed L points 
out of the page. This means r and p must lie in the plane of the page. Argue, by 
considering two successive instants of time, that the orbit will lie in the plane of the 
page. (Consider the plane of the triangle made of r(t), r(t + dt), and dr.) 


Exercise 6.12 A particle at position r = 3i + 4j has momentum p = 4i + 3j. 
(i) What is its angular momentum L? (Give its components, as well as magni- 
tude and direction.) (ii) Deduce the angle between r and p from the magnitudes 
of L, p, and r. 


Exercise 6.13 A marble of radius r and mass m is at the top of a loop-the- 
loop of radius R >> r with the minimum speed and is rolling without slipping. 
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Neglecting p» what is its kinetic energy of (i) rotations about its CM and 
(ii) translation of the CM? (iii) What if % is not neglected? 


Exercise 6.14 A block of mass m = 4.8kg is initially held at rest on an inclined 
plane with a 0 = 30° slope, at a height of h = 1m. It is attached to a cylindrical 
pulley at the top of the slope (mass M = 1.7kg and radius 8cm) by a massless 
string that unwinds as the block is allowed to slide downhill. (i) If the acceleration 
of the block is a = 1.9%, what is ug? (ii) Find the velocity v at the bottom of the 
slope using forces and torques. Repeat using energy ideas. 


Exercise 6.15 A sphere of mass m and radius R rolls without slipping down a 
plane at an angle 6 above the horizontal. By applying the dynamical laws for trans- 
lation and rotation and the condition of rolling without slipping, show that the 
acceleration of the CM (measured along the slope) is dem = 2 gsin@ and that the 
force of static friction needed is f = 2mg sin@. What is the minimum coefficient 
of friction xs that is needed? 


Exercise 6.16 A spool of thread of radius R and mass M is supported against 
gravity. The free end of the thread is attached to the ceiling. (i) Compute the 
instantaneous acceleration of the spool when the supporting force is suddenly 
removed. (ii) What is T, the tension on the string? 


Exercise 6.17 A disk of radius R and mass M is spinning at an angular velocity 
oo, A non-rotating concentric disk of mass m and radius r drops on it from 
a negligible height, and the two rotate together. Find (i) the final œ and (ii) the 
fraction of initial kinetic energy left. 


Exercise 6.18 A star of mass 10°°kg and radius 10° m is rotating with a time pe- 
riod T = 10°s. It then undergoes collapse to a neutron star of radius 10km. Find 
the initial and final J (assuming the star is a sphere of uniform density at all stages) 
as well as the final time period. Just keep track of powers of 10. 


Exercise 6.19 A ballerina spinning at 6*2 reduces her moment of inertia from 
P 8 5 


6kg- n? to 4kg- m°. (i) What is her final angular velocity? (ii) How much work 
does she do? 


Exercise 6.20 A gyro consists of a solid disk of radius R mounted at one end of a 
shaft of zero mass and length /, the other end of which is on a pivot. The disk spins 
at @ rad and the gyro precesses at o,f. What is / in terms of œp, œ, g, and R? Give 
a number for / when R= .6m, Wp = 2.64, and the disc is spinning at 450rpm. 
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Exercise 6.21 Two beads of mass m are free to slide on a rod of length L and 
mass M as in Figure E-4(a). Initially the beads are at the center and the rod is 
spinning freely (with no external torque) at œo rad about a vertical axis through its 
center. Slowly the beads move radially out (at negligible velocity). (i) Find w(7), 
the angular velocity, when the beads are a distance r from the center. (ii) What is œ 
when they just fly tangentially off the rod? Argue that w does not change hereafter. 
Hint: Using the formula L = r x p for the angular momentum of beads, show that 
their L does not change even though r does. Also are there any torques on the 
spinning rod after the beads detach? (iii) Why was the force of friction between 
beads and rod unimportant in the preceding discussion? 


Exercise 6.22 A disk of mass M and radius R is rotating at wp. A point mass M 
at rest is dropped on to its rim from negligible height and sticks to it. Compute 
the new œ and kinetic energy K. 


Exercise 6.23 What is the minimum height H from which a marble should be 
released such that it can make it around the loop-the-loop of radius R? Neglect 
the radius r of the marble compared to R. 


Exercise 6.24 A billiard ball with zero angular velocity and linear velocity vp is 
released on a horizontal surface with coefficient of friction wx. It begins to roll 


2 
without slipping after traveling a distance d. Show that ug = Pa Hint: Find the 
force of friction in magnitude and direction and its effect on the CM velocity and 
angular velocity. Remember that v = œR does not apply when there is rolling with 


slipping. 


Exercise 6.25 This is a variant of the previous problem. A billiard ball with zero 
linear velocity and angular velocity œọ is released on a surface with coefficient of 
friction ug. (i) What will be its final angular velocity after rolling without slipping 
is achieved? (ii) How long will it take to achieve this? (iii) How far does it travel in 
this time? 


Exercise 6.26 A rod of mass M and length L is hanging vertically from one end. 
A putty of mass m moving horizontally with speed vo strikes it at a distance x 
from the pivot point and sticks to it. (i) What is the ratio K¢/K; of the final to 
initial kinetic energies? (Be clear about what conservation laws apply.) (ii) For the 
case X= Z determine the height h to which the CM of the rod+putty system will 
rise. 


Exercise 6.27 A rod of mass M and length L is hanging vertically from one end. 
A putty of mass m and horizontal speed vo strikes it at its midpoint and sticks to it. 
What is the minimum 1 that will allow the final combination to rotate by 180°? 
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Exercise 6.28 A boy of mass m is standing on the rim of a platform of radius 
Rand moment of inertia Icy = Ip. The boy begins to walk along the rim at a 
speed vo relative to the platform. (i) What is the angular velocity of the platform? 
(ii) What is the boy’s velocity relative to the ground? (iii) How long will it take for 
him to come to return to the initial position relative to the ground? Hint: Work 
out all quantities in the ground frame. 


Exercise 6.29 Consider the Atwood machine shown in the middle third of 
Figure E-1. Let the pulley have a non-zero mass M and radius R. What is the 
upward acceleration a of m, the mass on the right? (Since the pulley has a non- 
zero moment of inertia, the tensions on the two sides are necessarily unequal to 
ensure the pulley’s angular acceleration.) 


Exercise 6.30 In the last problem let mı = 4kg, m = 2kg, and M = 5kg. 
(i) What is the downward acceleration a of the 4kg mass? (ii) What are the ten- 
sions T and T, on the 2kg and 4kg masses? (iii) What is the speed of the 4kg block 
after it descends 4m? 


Exercise 6.31 A uniform rod of weight 400N is suspended from the ceiling by a 
pivot. The rod is then held at an angle 30° below the horizontal by a horizontal 
force F applied to the free end. Find the value of (i) F and (ii) the force F, applied 
by the pivot. 


Exercise 6.32 A massless rod that leans to the left at an angle 45° above the hor- 
izontal is supported on the ground by a pivot and at its midpoint by a horizontal 
cable that is anchored on the wall to its right. The rod supports a load of 200N at 
its free end. (i) What is Fp, the force exerted by the pivot? (ii) Repeat if the rod has 
a weight 100N. 


Exercise 6.33 A sphere of radius R is supported by a rope attached to a wall as 
shown in Figure E-4(b). The rope makes an angle 0 with respect to the wall. The 
point where the rope is attached to the ball is such that if the line of the rope is 
extended, it crosses the horizontal line through the center of the ball at a distance 
3R/2 from the wall. Show that the minimum zs between wall and ball for this to 
be possible is 4s = 5 cot. Evaluate this for 9 = 30°. Hint: Find the right place to 
take torques. 


Exercise 6.34 A horizontal rod of mass m= 8kg and length L = 2.4m is hinged 
to a wall and supported by a cable that makes an angle of 9 = 25° as shown in 
Figure 11.2 in Chapter 11. (i) Ifin addition a mass M = 25kg is welded to the far 
end of the rod (where the cable meets the rod), what is the tension T on the cable, 
and what is the force exerted by the pivot? (ii) If the cable snaps, with what œ will 
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(a) (b) (c) 


Figure E-4 (a) Exercise 6.21 (b) Exercise 6.33 (c) Exercise 6.35 


the rod and mass slam into the wall? Do it using torques and also using energy 
conservation. 


Exercise 6.35 A rod of length L and mass m is supported at one end by a hinge 
on the wall and at the other by a cable as shown in Figure E-4(c). It also carries a 
weight M as shown. Find (i) the tension T in the cable and (ii) the force F exerted 
by the hinge. Give the numbers when mg = 1,000N, Mg = 2,000N, 6, = 60°, and 
b = 30°. 


Exercise 6.36 A seesaw is 4m long and has a 100N kid sitting on the left end 
and a 30N kid on the right. You want to enlist another kid to balance the seesaw. 
(i) What is the lightest kid you can enlist and where will you place her? (ii) If you 
get a kid of double this weight, where will you place her? (iii) If there are no extra 
kids but you can move the support away from the midpoint, where will you place 
it for a balance? 


PROBLEM SET 7, FOR CHAPTERS 12, 13, 14, AND 15 


Exercise 7.1 Explain why c = 1 if you use light-seconds as the unit of distance 
and a second as the unit of time. The same logic holds if you use a light-year as the 
unit of distance and a year as the unit of time. How long is a light-second? 


Exercise 7.2 The earth and sun are 8.3 light-minutes apart. Ignore their relative 
motion for this problem and assume they live in a single inertial frame, the Earth- 
Sun frame. Events A and B occur at t4 = 0 on the earth and at tg = 2 minutes 
on the sun, respectively. (i) Find the time difference t} — t} between the events 
according to an observer moving at u = 0.8c from earth to sun. (ii) Repeat if the 
observer is moving in the opposite direction at the same speed. 


Exercise 7.3 In the Earth-Sun case of Exercise 7.2: (i) What is the speed of a 
spacecraft that makes the trip from the sun to the earth in 5 minutes according to 
the onboard clocks? (ii) What is the trip time in the Earth-Sun frame? (iii) Find 
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the spacetime interval (As)? between them in light-seconds. Think in terms of 
events, assign as many possible spacetime coordinates as you can to each event 
in any frame, and use the Lorentz transformation. Measure time in minutes and 
distance in light-minutes. 


Exercise 7.4 A muon has a lifetime of approximately 2 x 10~°s in its rest frame. 
It is created 100km above the earth and moves toward it at a speed 2.97 x 108 A, 
(i) At what altitude does it decay? (ii) According to the muon, how much of the 
atmosphere went past it in its brief life? (iii) What is the difference in proper time 
between its birth and death? 


Exercise 7.5 Show that a muon of lifetime 2.2 us moving at v = .9999c can make 
it to the earth from an altitude of 30km. 


Exercise 7.6 What should be the minimum velocity of a muon of lifetime 2.2 x 
10~°s if it is to make a journey of 20km? 


Exercise 7.7 A meter stick moves past me at u = .8c. (i) What is its length ac- 
cording to me and (ii) how long does it take to move past me? (iii) In the rod’s 
frame, how long did I take to go from one end to the other? 


Exercise 7.8 A train of proper length 100m zooms past a stationmaster at u = 
0.6c. The passing of the front end and stationmaster is the origin of spacetime 
coordinates in both frames. (i) What is the length of the train according to the 
stationmaster? (ii) At what time on the platform does the rear end of the train 
pass the stationmaster? (iii) What is the time for this event in the train frame? 
(View the stationmaster in the train frame or use the Lorentz transformation of 
coordinates in the station frame.) (iv) If at (0,0) a light pulse was emitted, when 
would it reach the rear of the train in the train frame? (v) In the station frame? 
(vi) What is the spacetime interval between the front passing the stationmaster 
and the rear passing the stationmaster? (Easiest in station frame.) 


Exercise 7.9 A train (primed coordinates) of rest length Lo = 200m moving at 
v = 0.8c is approaching a tunnel (unprimed coordinates) of length L = 120m. 
(i) Argue that the entire train can fit into the tunnel at one instant. (ii) Consider 
three events: FB, when the front of the train is next to the back of the tunnel; FF, 
when the two fronts are coincident; and BB, when the two backs are coincident. 
(Sometimes it is useful to see the Lorentz contracted tunnel as rushing past the 
static train.) Assuming FB is at the origin in spacetime for both train and tunnel 
frames, what are the coordinates of FF and BB? (iii) What is the spacetime interval 
between FF and BB? (Which is a better frame for this calculation?) 
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Exercise 7.10 A clock moving at velocity u = 2c passes me, sitting at my origin, 
at t = t = 0 according to it and my clock. (i) What is the clock’s location (in light- 
seconds) in my frame when it ticks 1 second in its frame? (ii) If it emits a light 
pulse at that time, at what time f according to me will that pulse reach my origin? 
Use (x, t) for me and (x’, t’) for the clock frame. 


Exercise 7.11 In my frame, event B occurs 4ys after and 3km to the right of event 
A. (i) What is the velocity u of an observer for whom the events occur simultane- 
ously? (ii) Is the order of events reversible in some other frame? (iii) What is the 
spacetime interval between the events? (iv) What is the velocity of an observer for 
whom the events occur at the same point? 


Exercise 7.12 A rocket leaves earth for a star 4 light-years away at u = $c. 
(i) When does it get there in the earth frame? (ii) In its own frame? (iii) What 
was the distance between the earth and the star according to the rocket observer? 
Hint: Use the light-year as the unit of distance and the year as a unit of time to 
avoid very large or small numbers. (iv) What are the unprimed coordinates of the 
rocket reaching the star in the earth frame? (v) What are the primed coordinates 
for this event in the rocket frame, obtained using the Lorentz transformation of 
the results from the last part? Explain x’ = 0. 


Exercise 7.13 A pair of twins leave for a star 24 light-years away at speeds 0.9c 
and 0.5c. What is their age difference when they reunite on earth? Hint: Carefully 
visualize the sequence of events. 


Exercise 7.14 An observer S who lives on the x-axis sees a flash of red light at 
x = 1210m. Then, after 4.95, a flash of blue at x = 480m. Use subscripts R and 
B to label the coordinates of the events. (i) What is the velocity relative to S of 
an observer Sọ who records the events as occurring at the same place? (ii) Which 
event occurs first according to So, and what is the measured time interval between 
these flashes? For the former you do not need to do a calculation. For the latter I 
suggest using the spacetime interval. 


Exercise 7.15 Do not skip! Needed for what follows. Write the 4- momentum 
P= È, pc) of a particle of mass m in terms of its rapidity defined by * = tanh8. 


Exercise 7.16 (i) A rocket going at 5c relative to an inertial frame S launches 
another rocket going at ic relative to itself. This second rocket then launches a 


third rocket moving at $c relative to itself. What is the speed of the third rocket 
in the frame S? (ii) What are the rapidities of the first rocket, second rocket, and 
third rocket in S? (iii) What is E/mc? for the third rocket? 


Exercises 469 


Exercise 7.17 (i) Show that the Lorentz transformation may be written as 
X = xo cosh — x sinh 8 


x, = x cosh@ — xo sinh 0 


if the primed frame has rapidity 8 relative to the unprimed. (ii) If (xj, x) corre- 
sponds to a frame moving at rapidity 0’ relative to the primed, express (xj, x/’) in 
terms of (xo, xı), use the identities 14.19 and 14.20 in the book, and confirm that 
the rapidity of the doubly primed frame relative to the unprimed is 0 + 6’. If you 
know matrices, relate the two-dimensional column vector X’ with components 
(xp, x1) to those of X via a matrix M(@). Next relate X” to X’ by a matrix M(0’). 
What is the matrix M that relates X” to X? Simplify the answer using identities 
14.19 and 14.20 in the book. 


Exercise 7.18 A constant force F acts on a particle of mass m initially at rest. 
What is 0 (t), its rapidity at time t? (Force is still the rate of change of the relativistic 
momentum p. Start with p in terms of 0.) Give the answer in terms of an inverse 
hyperbolic function. Show that for large times we may approximate 0 (t) ~ In is 


Exercise 7.19 What is the energy of a proton whose momentum is 4mc? 


Exercise 7.20 At what rapidity will an electron have the same energy as a proton 
at rest? (Approximate the relevant hyperbolic function for large 8.) 


Exercise 7.21 Two fragments in a stellar explosion are moving at velocities +0.8c 
according to some inertial observer. (i) What is the speed at which their separation 
is growing according to the observer? (ii) Why does this not violate the speed limit? 
(iii) What is the velocity of one fragment according to the other? 


Exercise 7.22 Two rockets of rest length Lo are approaching the earth from op- 
posite directions at velocities +4c. How long does one of them appear to the 
other? 


Exercise 7.23 An excited nucleus of mass m* decays from rest into a lighter 
nucleus of mass m and a photon. What is the kinetic energy (E — mc”) of the 
final nucleus? Do the calculation using 3-vector momentum and energy and then 
4-vectors. For the latter, isolate the photon 4-momentum. 


Exercise 7.24 A body of rest mass mọ moving at speed v collides with and sticks 
to an identical body at rest. What is the mass M and momentum p of the final 
clump? 
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Exercise 7.25 A body of rest mass mọ moving at speed v approaches an iden- 
tical body at rest. Find V, the speed of a frame in which the total momentum is 
zero. Do this first by the law of composition of velocities, starting with how you 
would do this non-relativistically. Next, repeat using the transformation law for 
the components of the energy-momentum vector. 


Exercise 7.26 A body quadruples its momentum when its speed doubles. What 
was the initial 7? 


Exercise 7.27 Let c = 1 in this exercise. Show that a photon cannot break up 
into an electron and a positron. For our purposes the electron and positron are 
identical particles with four-momenta (with c = 1) P; = m(y1;y1v1) and P2 = 
m(y232V2) where y = 1/./1— v? / c. The photon four-momentum is K = (@,k) 
with w = |k|. I suggest you use energy-momentum conservation and the dot 
products of four vectors to show that this process is kinematically forbidden. 


Exercise 7.28 Let c = 1 in this exercise. A particle of rest mass mo decays at rest 

into a photon and loses rest mass 6 in the bargain. Show that the photon energy is 
= ô 

Exercise 7.29 The kaon decays into two pions in its rest frame. (i) Given mg = 

3.567m,, find the energy and momentum of either pion in terms of mr and c. 

(ii) What is the pions’ rapidity? 


Exercise 7.30 Consider the reaction in which two protons collide and form three 
protons and an antiproton, as described in Section 15.1.2. (i) What is the mini- 
mum rapidity of the incoming proton in the lab frame? (ii) What is the minimum 
rapidity of the incoming protons in the CM frame? (iii) What is the relative 
rapidity Op in each frame? Explain the answer. 


Exercise 7.31 A particle decays into three fragments with the following 4- 
momenta: P; = (E; = 3GeV,p, = 2iGeV/c), P) = (E = 4GeV, p2 = 3jGeV/c), 
and P3 = (E; = 5GeV, p3 = 4kGeV/c). What is its rest mass M? 


Exercise 7.32 When a pion (mass 135MeV/c’) decays at rest into two photons, 
what will be the energy and momentum of the two photons (in MeV and MeV/c)? 


PROBLEM SET 8, FOR CHAPTERS 16 AND 17 


Exercise 8.1 Visualize the number zı = 1 + iin the complex plane. Try to figure 
out in your head the phase and magnitude of z2 = i/z, and confirm. Repeat for 
Z3 = 1 y Zi. 
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Exercise 8.2 Find the real part, imaginary part, modulus, complex conjugate, 
344i 1+v2i 
zap (iv) ae 
(Do not use a calculator or Mathematica if you want to learn the subject. Leave 
the answers in terms of integers, 2, etc., if you want. Write as a + ib when 


appropriate. ) 


and inverse of the following numbers: (i) 5p (ii) (3+ 41)’, (iii) 


Exercise 8.3 For the following pair of numbers, give (i) their polar form, their 
complex conjugates, their moduli, (ii) their product, and their quotient z1 /z2: z1 = 


oR and 2 = V3 i. 


Exercise 8.4 Express the sum of the following in their polar form and decimal 
form: zı = 2e'7/4 and z = 6e7/3, 


Exercise 8.5 Consider De Moivre’s theorem, which states that (cos@ +isin@)” = 
cos n@ + isin nô. This follows from taking the nth power of both sides of Euler’s 
theorem. Use this to find the formula for cos4@ and sin 46 in terms of cos@ and 
sin. Given ette’? = e(A+5), deduce cos(A + B) and sin(A + B). 


Exercise 8.6 Verify the relation cos20 = cos? 0 — sin? 0 to order 04 by writing 
down the left-hand side to order 04 and multiplying out the power series for the 
sin and cos on the right-hand side and grouping terms up to 6*. 


Exercise 8.7 Expand up to terms of order x° the functions fi (x) = = and 
hax) = (i= T: 


Exercise 8.8 (i) Expand f(x) = 2% to order x*. (ii) Why will it not have an 


l+cosx 
xX term? 


Exercise 8.9 Expand to order x?: (i) sin(x + > (ü) e*—!, and (iii) In(2+ x). 


Exercise 8.10 The period of a simple pendulum of length L is given by the exact 


formula 
L tr d 
T=4 | f 9 
8g Jo 1—k sin’ 


where k = sin $, and a is the maximum displacement from the vertical. Show that 
in the limit a — 0, we regain the formula we learned in kindergarten. (i) Expand 
the integrand to the next order in k and find the modified T. (ii) What percentage 


deviation does this correction produce for a = 30°? 
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Exercise 8.11 A mass oscillates with some wo when connected to a spring. If the 
spring is cut in half, what will be the new œ? Hints: What will be the new force 
constant? If the original spring extended by 6 under some force, how much did 
the half spring expand? 


Exercise 8.12 Two identical masses are connected to springs with force constants 
k and 16k, displaced identically, and released. (i) If the shorter time period is 1s, 
when will they first return to their starting positions at the same time? (ii) How 
many oscillations would each have completed? 


Exercise 8.13 Two harmonic oscillators with periods Tı = 2s and T) = 5s are 
displaced by equal amounts and released. (i) After how long will they return 
to their starting points simultaneously? (ii) How many cycles would each have 
completed? (iii) What if the Tı = 2s oscillator had been displaced by double the 
amount as the 5s one? 


Exercise 8.14 Consider a particle of mass m attached to a spring with force con- 
stant k, executing the oscillation x = A sin(wt + ô) with A = 0.32m. At t = 0, 
the particle is at x = —0.07m with velocity v(0) = —2%. The total energy is 
5.6]. Find (i) ô in radians, (ii) f the frequency, (iii) k, and (iv) m. Hint: Since 
sinĝ = sin(x — @), you have two choices given just x(0). You must pick the one 
that gives a negative initial velocity, that is, with negative cos 0. 


Exercise 8.15 Write down the equation of motion for a mass m hanging from a 
spring of force constant k. Show that x(t) = (x(t) — xo), where kxp = mg, obeys 
k 


the equation for an oscillator at the same frequency wo = ,/ ;;, in the absence of 


gravity. Interpret the solution. 


Exercise 8.16 An unknown mass attached to a vertical relaxed spring descends 
by 4cm before beginning to move back up in the course of its oscillations. Vi- 
sualize the oscillation, in particular the end points. (i) What is the amplitude A? 
(ii) When will there be zero net force on the mass? (iii) What is the time period of 
vibrations T? 


Exercise 8.17 A horizontal disk of radius R is suspended at its center by a spring 

with force constant k and torsional constant x. What is the ratio @ of its linear 
. . . . r 

and rotational frequencies of small oscillations? 


Exercise 8.18 A mass m oscillates at 60Hz when hanging from a spring with 
force constant k. Adding 2kg to the mass changes the frequency to 30Hz. Find m 
and k. 
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Exercise 8.19 A block of mass m rests atop a block of mass M, and us is their 
mutual coefficient of friction. The lower block rests on a frictionless table and is 
connected to the wall by a spring with force constant k. The spring is pulled by 
an amount A and released. At what moments in the oscillation is the upper mass 
most vulnerable to slipping off? What is the condition on jy, to ensure this does 
not happen? 


Exercise 8.20 Show that the critically damped solution of Eqn. 17.76 will never 
turn negative if it starts from rest with some positive x(0). 


Exercise 8.21 A steel beam of mass M and length L is suspended at its midpoint 
by a cable and executes torsional oscillations. If when two masses m are attached 
to either end of the beam the frequency is reduced by 10%, what is 77? 


Exercise 8.22 The bob of a pendulum of length 2m is moving at 2% at its lowest 
point. What is (i) its time period T, (ii) the angular frequency at this instant, and 
(iii) the amplitude in degrees? (Assume sin@ ~ 0.) 


Exercise 8.23 A rod of mass M and length L hangs vertically from the ceiling, 
where it is supported by a pivot about which it is free to swing. Compute x, the 
restoring torque per unit angular displacement for small displacements, and show 
that the frequency of small oscillations is f = +y z What is f if it swings about 
a point a distance x above the CM? 


Exercise 8.24 A V-shaped piece of wire, with an opening angle 0 between its 
two arms of length L, is supported at its vertex by a pivot. Show that for small 


3g cos 4 


oscillations œ = . (Consider the restoring torque per unit angle when the 
wire is displaced from equilibrium.) 


Exercise 8.25 A rod of length L rests horizontally in a vertical frictionless circle 
of radius R, forming a chord subtending an angle of 120° at the center. Show 
that the frequency of small oscillations about the roď’s equilibrium (horizontal) 
position is that of a simple pendulum of length R. (Find x, the torque per unit 
angular displacement about the center of the circle and the correponding moment 
of inertia. Does the force of the frictionless circle on the rod contribute to this 
torque?) 


Exercise 8.26 A thin disk of mass M and radius R is balanced on a table on its 
rim. A tiny mass m is glued on the circumference just above the point of contact 
with the table. When the disk is given a small angular displacement and released, 
it rolls back and forth in the vertical plane without slipping. Show that the period 
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3MR 


of oscillations will be T = 2z Zmg' 


Why? 


Here we ignore the moment of inertia of m. 


Exercise 8.27 Recall the discussion of the physical pendulum depicted in its 
equilibrium configuration in the lower left part of Figure 17.3. Let the period be 
T when the supporting nail P is a distance |, vertically above the CM. It turns out 
that there is a second pivot point a distance h on the opposite side of the CM (i.e., 
I, + h below P in the figure) such that if supported there, the same T will result. 
First show that the equality of time periods implies [gm = Mh h. Then show that 
h + b = gT? / (4r ?). Make liberal use of the parallel axis theorem. 


Exercise 8.28 Consider a pendulum where the bob is a sphere of radius r and the 
supporting string has length L >> r, which allows us to treat it as a simple pen- 


dulum. Show that T = To,/1+ 25, where To is the period of a simple pendulum 
of the same length. 


Exercise 8.29 I am driving my car on a parkway which has bumps every 30m 
apart. At what speed must I be driving to experience violent shaking if the 
suspension in my car has a resonant frequency of 0.5Hz? 


Exercise 8.30 A 2kg mass is connected to a spring with k = 400%, The value of 
the damping coefficient is y = 1s~!. It is driven by a force F(t) = 10 cos(10t)N. 
What is (i) the amplitude at this œ, (ii) the resonant frequency œọ, and (iii) the 
amplitude at resonance? 


Exercise 8.31 (i) Show that a driven oscillator has its maximum amplitude of 


vibration at a frequency w = , Jo — (y 2/2). Hint: Extremize |Z(œ)|? instead of 
|Z(œ)| after explaining why. (ii) At what frequency does the velocity have the 
greatest amplitude? 


Exercise 8.32 For a damped oscillator (not driven by any external force) find the 
time T* after which the amplitude of oscillations drops to half its value in terms 
of y and m. 


Exercise 8.33 Find the particular solution when the oscillator defined by Eqn. 
17.77 in Chapter 17 has y = 1 and w = 4 and is driven by F(t) = 2mcos? t. 


Exercise 8.34 Consider a damped oscillator with k = 328, m= .5kg, and y = 
2s71, (i) Find the solution with x(0) = 2, v(0) = 0, using symbols and substituting 
numbers only at the end. Note: Since tan@ = tan(z +0), there will be two choices 
for the phase # given v(0) = 0. Pick the one that allows x(0) = 2 with positive 
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amplitude A. (ii) Find the particular solution xp due to a driving force Fy cost 
with Fy = 10N and w = 2a. Pick the ¢ that gives the positive value for v(0). (iii) 
Give the values of x(0) and v(0) of the full solution (complimentary + particular) 
such that there will be no need to bring in the complementary function. 


PROBLEM SET 9, FOR CHAPTERS 18 AND 19 


Exercise 9.1 The human ear can hear sounds from 20Hz to 20,000Hz. What are 
the corresponding wavelengths? Assume a sound velocity vs = 340%. 


Exercise 9.2 (i) Write an equation in MKS units for a wave moving along +x 
with amplitude 0.4m, speed 6%, and frequency 17Hz. (ii) If these are waves on a 


string with mass per unit length u = 0.02 i, what is u, the energy per unit length? 
(iii) What is the average power being fed into the vibrating string? 


Exercise 9.3 The speed of sound in water and air is 1450% and 330%, respec- 
tively. Sound from an explosion on the surface of a lake first reaches me when my 
head is underwater and 5s later when my head is above the water. How far away 
was the explosion? 


Exercise 9.4 At the beach you see the crests of the waves are 3m apart and 8 of 
them pass you in 12s. What is the wave velocity? 


Exercise 9.5 Consider the wave y(x,t) = 0.5cos(100zx — 60z t) on a string. 
What is the (i) amplitude, (ii) frequency, (iii) maximum velocity of a point, and 
(iv) maximum acceleration? 


Exercise 9.6 Find the maximum acceleration of points on a string supporting a 
standing wave with v = 16%, a spacing of 2m between nodes, and an amplitude 
of displacement of 2cm. 


Exercise 9.7 An infinite string carries waves of v = 50%, at f = 200Hz, A= 


2.4cm, and has u = 3 x 107° a What is (i) (u), the average energy density and 
(ii) (P), the average power? 


Exercise 9.8 A wire has a tension equal to its weight. What is the velocity of 
waves? 


Exercise 9.9 A mass M hangs from a rope of length L and mass m << M. If the 
lower end is given a transverse flick, how long will it take to reach the top? Do the 
numbers for L = 8m, m= 50g, M = 5kg. 


476 Exercises 


Exercise 9.10 A string is anchored on a wall at one end while the other end goes 
over a pulley and supports a weight Wj. It has no nodes (except at the ends). 
When W; is attached instead of W1, there is one node but the frequency is the same 
as before. What is W1 / W2? 


Exercise 9.11 A wire supports a fundamental frequency fọ. Consider a wire 
made of the same material but of half the length, of double the diameter, and 
at 4 times the tension. What will be the new fundamental frequency fo? 


Exercise 9.12 A mass M hangs vertically at the end of a cable of mass m and 
length L. (i) How long will a transverse pulse take to travel from bottom to top if 
you ignore m, the cable mass? (ii) Now repeat, including m and remembering that 
the velocity of the signal varies with the distance from the bottom end. Show that 
the answer reduces to part (i) as m— 0. (Do you see why you cannot set m = 0?) 


Exercise 9.13 A block of mass M sits on a frictionless inclined plane of angle 
a = 4. It is connected by a wire of linear mass density u = 0348 that goes over a 


pulley and supports a hanging mass m. Both masses are at rest. If transverse waves 
travel at v = 807 in the wire, find M and m, using symbols till the end. Ignore 
the mass of the wire in computing the tension on the wire; use it just to find the 
velocity of waves. 


Exercise 9.14 What is the ratio of sound intensities which differ by 1 dB? 


Exercise 9.15 A source of sound radiates uniformly in all directions. Along a ra- 
dial line from the source locate two points separated by 2m such that the intensity 
at the nearer point is 4 dB above that of the more distant point. 


Exercise 9.16 An isotropic source of sound has an intensity of 80 dB at a distance 
r= 10m. (i) What is the total power output P? (ii) At what r does the intensity fall 
to 60 dB? 


Exercise 9.17 I have anchored a massless speaker emitting sound at f = 600Hz 
on top of a mass connected to a spring. The mass-spring system oscillates horizon- 
tally at 4Hz with amplitude A. Given v; = 330m/s, and that the difference between 
the highest and lowest frequencies I hear is 2Hz, what is A? 


Exercise 9.18 A source of sound at 600Hz is placed at the edge of a horizontal 
disk of radius 1m rotating at 3“*. What is the range of frequencies heard by a 
listener on the same plane as the disk? Assume v; = 340 “. 
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Exercise 9.19 A microphone is executing vertical oscillations above a source 
emitting sound at fọ = 400Hz. The frequencies it picks up have a variation 
Of = 2Hz. (i) What is its maximum velocity vm? (Assume the velocity of sound 
is v; = 3407.) (ii) What is A, the amplitude of vibrations, if it has a period of 
T = 4s? First get the formula and then plug in numbers. 


Exercise 9.20 The ear canal is about 3cm long and can be viewed as a tube open 
at one end and closed at the other. Relate this to the fact that we seem to hear best 
at around 3,000Hz. 


Exercise 9.21 Longitudinal waves on a metal rod travel at 3,450%. (i) Find two 
of the lowest standing wave frequencies on a rod of length 2m clamped at one end 
and free at the other. (ii) Repeat if the rod is clamped at both ends. 


Exercise 9.22 How many antinodes are there on a string 80cm long, vibrating at 
1,600Hz and assuming a wave velocity on the string of 3207? 


Exercise 9.23 A tube is 20cm long. What is the fundamental frequency if it is 
(i) closed at both ends, (ii) open at both ends, and (iii) open at one end and closed 
at the other? Assume Vsounq = 3307. 


Exercise 9.24 A string at tension 600N, of mass per unit length u =5 x 1074 29 
supports a standing wave of 640 Hz with three nodes in addition to the ones at the 
ends. Draw a picture. What is (i) the wave velocity v, (ii) the length L of the string, 
(iii) the wavelength 4, and (iv) the equation for the wave if it has an amplitude of 


4cm? 


Exercise 9.25 A string of length L = 50cm and u = 107° ke is found to resonate 
only at 500Hz and 750Hz when stimulated in the range 400-800 Hz. (i) From the 
ratio of frequencies deduce the modes and the tension T on the string. (ii) Why 
can we rule out these being the 4th and 6th harmonics? 


Exercise 9.26 A block of mass m is on an inclined plane of angle 8. It is held on 
the slope by a string of length L, with mass per unit length u. We want the string 
to support vibrations at some Fo. (i) Write an expression for the allowed angles 
0 that ensure this and describe the mode of vibration. (ii) Provide allowed angles 
above 10° for M = 10kg, u = 0.02, Fy = 200Hz, L = 0.4m. 


Exercise 9.27 Two speakers emitting sound at 550Hz are at (x = 0,y + 0.75). 
The first destructive interference takes place (4m,0.8m). What is the velocity of 
sound? (i) Do this using Pythagoras’ theorem to calculate distances exactly and 
(ii) compare to the large-distance approximation dsin@ = (n+ 5)A, valid when 
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the distance between the sources is much smaller than the distance to the point 
where the waves are added. 


Exercise 9.28 Two speakers emitting sound at 660Hz are 2.0m apart. Find the 
locations in the line joining them where the resultant sound is (i) a maximum and 
(ii) a minimum. Assume v; = 330%. 


Exercise 9.29 Two sources of sound waves at 480Hz produce interference max- 
ima at 0° and +23°. Assuming v; = 340%, find (i) d, the distance between the 
sources and (ii) the angle for the next maximum. (iii) Why are there no more 
maxima? (iv) Where are the minima? You may use Eqns. 19.30 through 19.32. 


Exercise 9.30 Two waves y+ = 10sin(.57x+ 407 t) interfere to form a standing 
wave y (x,t). What is (i) y(x, t), (ii) the distance between nodes, (iii) the velocity 
and acceleration at a point x = 1? 


Exercise 9.31 I am at my yacht on the beach which runs along the y-axis. A wall 
has been built at x = 100m parallel to it so as to keep waves with 2 = 10m from 
rocking my yacht. A hole now develops at the point on the wall directly in front 
of my yacht, that is, (x = 100, y = 0). Where should I make a second hole nearest 
to this hole so as to have destructive interference at my yacht? Use Pythagoras’s 
theorem to compute distances. Of the two choices show that if I choose the one at 
y > 0, I can kill two birds with one stone by producing constructive interference 
at my neighbor’s yacht located at (0, 49.3). 


PROBLEM SET 10, FOR CHAPTER 20 


Exercise 10.1 A vertical tube of radius 1cm, open at the top to the atmosphere, 
contains 2cm of oil (poi; = 0.82 Pwater) floating on 3cm of water. What is the gauge 
pressure (i.e., pressure in excess of the atmospheric) at the bottom? 


Exercise 10.2 A person wants to suck water through a straw 120cm tall. What is 
the minimum pressure difference between the atmosphere and the inside of the 
person’s mouth? 


Exercise 10.3 A wide can of water is filled to height h. At what height from the 
base should I drill a hole to (i) get the maximum range for the jet that comes 
out and (ii) ensure that the jet travels a horizontal distance equal to the vertical 
distance? (iii) What is the maximum horizontal distance the jet can travel? Neglect 
the velocity of water at the top of the can. 


Exercise 10.4 A rectangular hole is made on the side of a cylindrical container 
of water. The hole has width w, and its upper and lower edges (parallel to the 
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base of the cylinder) are at depths dı and dz, respectively, measured from the wa- 
ter level. (i) Show using integration that the initial flow rate out of this hole is 
ax Jag | a! r= a! *] m, neglecting the motion of water at the top of the tank. 
(ii) Suppose d) = dı + ô, where ô is very small, so that we can ignore the varia- 
tion of the depth from the top to the bottom of the hole. Compute the flow rate 
and show that the exact answer above reduces to this result in the same limit. 
(Remember (1 + x)” =1+nx+...). 


Exercise 10.5 A body of density p is dropped from a height H into a lake where 
the water density is pw > p. (i) To what depth d will it sink before beginning to 
rise up? (ii) When it floats, what fraction f of its volume will be submerged? 


Exercise 10.6 A piece of cork of density p = 0.22Pwater is held down fully sub- 
merged in water by a thread anchored at the base of the container. What is -Æ 


mg? 
where T is the tension of the thread and m is the mass of the piece of cork? 


Exercise 10.7 A cylinder of cross section A and mass m floats vertically in a 
fluid of density p (in other words, the cross section is parallel to the surface). 
If disturbed from this equilibrium, what will be the period of small (vertical) 
oscillations? 


Exercise 10.8 A horizontal pipe of diameter dı = 10cm carrying water has a con- 
striction of diameter d) = 4cm. If P = 10° Pascals and Py = 8 x 104 Pascals, what 
is the flow rate? 


Exercise 10.9 Find the lift on an airplane wing of area 50m” if the velocity of air 


at the upper and lower parts is 90% and 70%, respectively. Assume pair = 1.1648, 


Exercise 10.10 A wall of an outdoor pool is 10m long and 2m deep. What force 
is needed to keep it in place? 


Exercise 10.11 A balloon of mass M = 4kg has a volume V = 187? and is filled 
with m = 3.2kg of He. Assuming pair = 1.3 x, compute the (i) buoyancy force 
and (ii) acceleration of the balloon. 


Exercise 10.12 A chandelier weighing 980N is supported by a suction cup 
(which has no air in it). What should be its minimum area to support the 
chandelier? 


Exercise 10.13 You know that in an accelerated frame of reference, there will be 
a force —ma acting on every mass m. Use this to show that when a beaker of water 
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is accelerated at a = ia, the surface of the water will make an angle 0 = tan™! a/g 
with respect to the x-axis. 


Exercise 10.14 Consider the hydraulic press shown in Figure 20.5 in Chapter 20. 
(i) If the areas of the pistons are 1m? and 2m’, what force is required in the smaller 
piston to support a 1,000kg load on the bigger one? (ii) How far must the smaller 
piston be moved to lift the 1, 000kg by 2cm? (iii) What is the ratio of work done in 
the two sides? 


Exercise 10.15 Consider a U-tube with both limbs open to the atmosphere with 
equal heights of mercury pyg = 13.6 x 10° x When the air (pa = 14g) at speed 


v begins to flow above just one limb, the mercury there rises by 4 cm above the old 
level. What is v? (Remember, one limb’s gain is the other limb’s loss.) 


Exercise 10.16 Consider water spinning in a bucket with angular velocity œ. Let 
us prove that the surface will assume a parabolic shape. Go to the rotating frame, 
where in addition to gravity there is a radially outward centrifugal force. Find the 
direction of the vector sum of that force and the force of gravity. This sum will 
be the “local force of gravity,” and the surface of water will be perpendicular to 
it. From the tangent of the angle the vector sum makes with the x-axis, infer that 
Z = Cx, where C is a constant. (i) What is C? (ii) Integrate this relation to show 
that y(x) is parabolic. 


Exercise 10.17 Fluid moving at 60% courses through a tube of diameter 4mm. 
The tube then branches into three identical tubes of diameter d in which the fluid 
flows at 80". What is d? 


Exercise 10.18 That pressure is transmitted uniformly was dramatically illus- 
trated by Pascal (after whom the unit of pressure is named). Imagine a water-filled 
barrel with a lid of radius R and a tube protruding from it filled with water to some 
height h. (i) What is the pressure due to the water column? (ii) What is the force 
on the lid? Crunch the numbers for h = 12m and R= 0.5m. Apparently upon 
adding that last drop of water, the barrel exploded. 


Exercise 10.19 A piece of cork (density pc) of weight 1N when immersed in 
water (density py = 10° s) has to be held down by a force of 3N. What is pw/pe? 
Hint: Assume a volume V for the cork. 


Exercise 10.20 A balloon of volume V, mass 2kg, and filled with He (pHe = 
0.18 $) can carry a load of 1,000N. What is V? (pair = 1.2948) 
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PROBLEM SET 11, FOR CHAPTERS 21, 22, 23, AND 24 


Exercise 11.1 How much heat is needed to convert 1kg of ice at —10°C to steam 
at 100° C? Remember ice and water do not have the same specific heat. 


Exercise 11.2 A lead bullet of mass m and specific heat c = 13072x moving at 
vo = gets embedded in a lead block of mass M. Assuming the loss of kinetic energy 
goes to heating, what is AT, the rise in temperature of the final object? Do the 
numbers for m = 5g, M = 100g, vo = 200%. 


Exercise 11.3 I mix 500g of water at 20°C with 200g of ice at 0° C. What is the 
end product? 


Exercise 11.4 I mix 20g of steam at 100°C with 150g of ice at 0° C. What is the 
end product? 


Exercise 11.5 A snowball falls to earth at v = 100%. What fraction of it will melt 
if the kinetic energy is converted to heat? 


Exercise 11.6 If 400g of ice at —2°C is placed in 1kg of water at 21° C, what is 
the end product when equilibrium is reached? 


Exercise 11.7 To find cy, the specific heat of material X, I place 75g of it in a 30g 
copper calorimeter that contains 65g of water, all initially at 20°C. When I add 
100g of water at 80°C, the final temperature is 49°C. What is cx? 


Exercise 11.8 Two slabs L and R of the same area and thickness are glued to- 
gether. The two external faces are held at Tz and Tp, respectively. If xR = 3x, what 
is Tp, the temperature of the interface? Do the numbers for Tr = 0°, Tp = 80°. 
Does this remind you of something in DC circuits? 


Exercise 11.9 A wall 8m x 20m has a thickness of 15cm. Given k = 0.8, find 
the energy flowing across in a day if the temperature difference is 15°C. 


Exercise 11.10 A copper rod (x = 401 a) of length 50cm and radius 2cm has 
one end dipped in an ice-water mixture and the other in boiling water. What is 
the heat flow dQ? 


Exercise 11.11 How much heat flows out per second through a concrete (x = 
1“) roof of area 100m? and thickness 20cm if the outside is at 0°C and the 
inside is at 17°C? 
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Exercise 11.12 Consider a hollow sphere of inner and outer radii rı and r2 main- 
tained at Tı and T>, respectively. What is the rate of flow from the inner to the 
outer surface in terms of x? (i) First assume Ar = r — rı is so small that we can 
treat the two faces as parallel surfaces and apply the appropriate formula. Show 


ee 2 : f . 
that in this case a =—k(h- T) — a . (The minus sign means that if the outer 


surface is hotter, heat flows radially inward.) (ii) Now consider more carefully a 
sphere of radius rı < r < r2. The flow rate there is 2 (r) = —4r rk a But this 
must be an r-independent constant —C in equilibrium (so there is not a pile up 
of heat). Start with a written in terms of C and integrate between rı and r2 and 


use the data at rı and rz to show a =—C= —4rk(h — Thee: 


Exercise 11.13 How many moles of ideal gas are there in a room of volume 50m? 
at atmospheric pressure and 300K? 


Exercise 11.14 A spherical air bubble of radius 2cm is released 30m below the 
surface of a pond at 280K. What is its volume when it reaches the surface, which 
is at 300K, assuming it is in thermal equilibrium the whole time? Ignore the size 
of the bubble compared to other dimensions like 30m. 


Exercise 11.15 What is the volume of one mole of an ideal gas at STP, Standard 
Temperature (273K) and Pressure (1 atmosphere)? 


Exercise 11.16 One mole of ideal nitrogen gas is at 2 atmospheres and occupies 
a volume of 10m?. Find (i) T in Kelvin, (ii) U the internal energy (assumed to be 
just kinetic energy) per molecule in joules, and (iii) the typical velocity of the gas 
molecules, which have a mass 4.65 - 10~7°kg. 


Exercise 11.17 An ideal mono-atomic gas in state A (P = 3 x 10°Pa,V = 
107m, U = 450J) is cooled at constant volume to B (P = 2 x 10° Pa, V = 
1073m) and then expands at constant pressure to C (P = 2 x 10° Pa, V = 3 x 
1073m). (Nothing has been said about the path CA. So don’t draw it.) What is 
(i) the work done, (ii) Uc, (iii) AQ, the heat added, (iv) the ratio of initial and 
final temperatures, Tc/Ta, and (v) the ratio of internal energies Uc/ U4? 


Exercise 11.18 A Sterling engine using an ideal mono-atomic gas follows the 
cycle shown in Figure E-5, bounded by isotherms at Tp and T; and two vertical 
lines at volume V; and V2. (i) Find its efficiency, defined as 


work output 


heat in 
as a function of Th, Te, V1, V2, and R. Before rushing in, make sure you know 
for each segment if any heat is absorbed or released. (ii) Show that 7 < yc 
of a Carnot engine operating between T; and Te. (Consider 1 — T) 
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Figure E-5 Sterling cycle for Exercises 11.18 and 11.19 


Exercise 11.19 Consider the Otto cycle, which looks like the Sterling cycle of 


Figure E-5, except the lines 12 and 34 are adiabatic curves. Show that 7 = 1— x = F 
and express it in terms of the compression ratio r = 2 = G, 
1 4 


Exercise 11.20 A heat engine goes through the rectangular loop 1 > 2 > 3 > 
4 — 1 with corners 1: (1 Pa; 1L);2: (4Pa; 1L); 3: (4Pa;4L);4: (1 Pa;4L). In addition, 
Tı = 100K and y = cp/cy = 1.4 where cp and c, are specific heats per mole. What 
is the efficiency of the engine? Hints: Assume only that cp = c, + R, deduce c, in 
terms of nR, and use the ideal gas law repeatedly. 


Exercise 11.21 An ideal gas at P = 200kPa has a volume of 4L. It expands 
isothermally until P = 100kPa. Find (i) W the work done and (ii) Q the heat 
added. 


Exercise 11.22 What is the final T of an ideal gas initially at 20° C if compressed 


adiabatically to half its original volume? Assume c, = ŠR. 


Exercise 11.23 A cylinder with a piston of cross section .04m° contains an ideal 
gas at T = 273K and atmospheric pressure. When a mass 10kg is placed on the top 
and lowered slowly, keeping the temperature constant, the gas acquires a smaller 
volume. (i) What is the percent drop in height? (ii) If now it is heated to bring the 
piston back to the top, what is the increase in T? 


Exercise 11.24 (i) What is the average speed of an O molecule (32g/mole) at 
300K? (ii) Repeat for a Hz molecule (2g/mole). (iii) Compare to the speed of 
sound 340%. Hint: Multiply both sides of Eqn. 22.8 in Chapter 22 by Na, which 
converts k to Rand m to M, the mass of one mole. 
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Exercise 11.25 Consider the Maxwell-Boltzmann distribution, Eqn. 22.19. 
(i) Using the result i xe dx = y5 , normalize the distribution so that 


E P(v)dv = 1. (ii) For a gas of N atoms, apply dN ~ N - P(v)dv for dv << 1 
to find the number of carbon molecules (m = 1.99 x 10726 kg) with 1,000% < 
v < 1,002%, when N = 10° and T = 300K. 


Exercise 11.26 If 20g of He are in a volume V = 6 x 107° m? at 300K, what is P? 


P P 
c c 
No A 

50kPa} B A 50 kPal- A 
| | v v 
5L 25L 5L 25L 


(a) (b) 
Figure E-6 (a) Exercise 11.27 (b) Exercise 11.28 


Exercise 11.27 A gas goes over the cycle ABCA as in Figure E-6(a) where AC is 
an isotherm and AB is an isobar. (Note that L stands for liter, with 1L = 1073.) 
(i) Find the (P, V) coordinates of C. (ii) What is the work done in each part of the 
cycle and (iii) the heat absorbed or rejected in the full cycle? 


Exercise 11.28 One mole of a gas with y = $ goes over the cycle ABCA as in 
Figure E-6(b) where one of AB or AC is isothermal and the other adiabatic. (You 
figure out which.) (i) Write down the (P, V, T) coordinates of A, B, and C (some 
of which are already given). (ii) What is the work done in each part of the cycle 
and the heat absorbed or rejected in the full cycle? 


Exercise 11.29 What is the increase in entropy if one gram ofice at 0° C is melted 
and heated to 50°C? 


Exercise 11.30 Find the change in entropy if 500g of water at 80°C is added to 
300g of water at 20°C. 


Exercise 11.31 One mole of an ideal mono-atomic gas, thermally insulated, is 
restricted by a partition to a volume Vo of a cylinder whose total volume is 3 Vo. 
Its pressure is Po. (i) When the partition is removed and the gas fills the whole box, 
what is the change in entropy of the gas and of the universe? (ii) What is the ratio of 
microscopic states available after the partition is removed to before? (iii) Suppose 
the partition is replaced by a movable piston which slowly lets the gas expand 


Exercises 485 


and provides an equal and opposite pressure until the volume becomes 3 Vo. (The 
system is still in complete thermal isolation.) What is the change in entropy now? 
(iv) What is the work done by the gas in terms of Vo and the initial pressure Po? 
(v) What is the change in internal energy? 


Exercise 11.32 Consider two Carnot engines in series: the first absorbs heat at 
Tı and rejects some at Ty, and the second absorbs all that rejected heat at Ty and 
rejects heat at T2. (i) What is 412, the efficiency of the combination in terms of T}, 
To, and T2? (ii) Express 72 in terms of 79 and 702. 


Exercise 11.33 Carnot’s postulation of the second law is that you cannot devise 
a process whose sole effect is to transfer some heat from a cold body to a hot body. 
Consider a Carnot engine that absorbs 100J at Tı and rejects 60J at T2. Someone 
claims to have a better engine which absorbs 100J at T; and rejects 20J at T2. Show 
how the two of them can be combined to violate Carnot’s postulate. 


Exercise 11.34 Consider a mole of a gas initially at 1 = (P), Vi) and finally at 
2 = (P2 < Pi, V2 > Vi). Since S2 — Sı is path independent, choose the simple 
path by first lowering pressure at constant volume and then increasing volume 
at constant pressure. Let 0 = (Po, To) be the intermediate point you go through. 
(Thus Py = P; and Vo = Vp.) Show that S)— S1 = cpln ( t)- Rin ($ 2). Show that 
if 1 and 2 lie on an adiabatic curve, this difference vanishes. Assume cp = cy + R 
but not a particular value to cy. 


Exercise 11.35 I mix 10kg of Cu (Ccu = 386 ERK XK at 100°C) with 5L of wa- 
ter (Cy = 4180; ig wax) at 0°C). What is (i) the final T, (ii) A Scum» (iii) ASy, and 
(iv) A Suniverse? 


Exercise 11.36 (A very thorough test of many topics and endurance.) Consider 
2 moles of an ideal diatomic gas taken through the cycle ABCDA shown in Figure 
E-7. You are given T4 = 800K, P4 = 6 x 10°Pa, Vg = 3V4,Pc = 10°Pa, AB is 
isothermal, BC is adiabatic, CD is at constant pressure, and DA is at constant 
volume. (i) Find (P, V,T) for all the corners. (ii) Find the work done in each 
part of the cycle and the total. (iii) Find the heat added in each part and (iv) the 
efficiency 7. 


Figure E-7 Exercise 11.36 
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Answers to Exercises 


Some answers may differ slightly from yours due to rounding. 


PROBLEM SET 1, FOR CHAPTER 1 


Exercise 1.1 (i) 16m, (ii) 16%, (iii) t= $, amin = —1.54, (iv) a= 204. 
Exercise 1.2 (i) v= —5, (ii) 0%, (iii) No, (iv) x(t) = 5(t— 1). 

Exercise 1.3 (i) v(t) = 4+ 12t + 120’, a(t) = 12 + 24t, (ii) .5s, (iii) a= 03. 
Exercise 1.4 (i) Given. (ii) Only v= 1 meets the definition of the average. 
Exercise 1.5 4m. 

Exercise 1.6 10°s. 


Exercise 1.7 63 me 


2 
Exercise 1.8 (i) 712, (ii) “4. (iii) The difference (ii) —(i) is ets ,- The answer 


(i) is smaller because the particle spends more time going slower. Visualize the 


extreme case vj = 0. 

3v1 V2 V3 
ViV2+172V3+V3 V1 ` 
Exercise 1.10 Given. 


Exercise 1.11 (i) 435 (ii) 18m. 
Exercise 1.12 18.6m. 
Exercise 1.13 (i) 105.1, (ii) 1.0s, (iii) 1.3s, (iv)—33%, (v) 6.65,55.2%. 


Exercise 1.14 (i) v = y2gH, (ii) v= y2g(H + h), (iii) [3s The simplicity of 


(iii) suggests that the best way to arrive at it is to consider the motion from the 


Exercise 1.9 v= 


topmost point. 

Exercise 1.15 a= (—83). 
Exercise 1.16 (i) 62.5m, (ii) 5s. 
Exercise 1.17 (i) 31.6%, (ii) 63.2s. 
Exercise 1.18 $v) T, 2500m. 


v a2 
Exercise 1.19 (i) a= #74, (ii) 0.964. 


Exercise 1.20 a= = 
Exercise 1.21 (i) After 1s, at x =5, (ii) amax = 2.13; te = 24s. 
Exercise 1.22 T = 2% v t2Hs—y2Hg 

7 : 


Exercise 1.23 t, = 4, h = 3 
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Exercise 1.24 t=2,x=11,v,) = 16,47. = 11; t= 7,x = 166, vı = 46,7 = 51. 
Exercise 1.25 (i) (w= 12, t = —4, ve = —20), (w= —4, t = 4, vc = 28). (ii) At these 
meetings x; (t) and x(t) are tangential. 


PROBLEM SET 2, FOR CHAPTER 2 


Exercise 2.1 (i) A+B = 8i — 2j, A— B = —2i+ 10j, 2A + 3B = 2li — 10j, C= 
—8i+ 2j, (ii) A=5, tan! å. 

Exercise 2.2 Given. 

Exercise 2.3 (i) r(2) = 28i + 24j, v(2) = 14i + 24j, a(2) = 13). (ii) 14i + 12j 2, 
12). 

Exercise 2.4 (i) 80s, (ii) 16s. 

Exercise 2.5 vgG = VaT + VTG = 18i — 2j. 

Exercise 2.6 127 degrees, 270s. 

Exercise 2.7 0.225. 


Exercise 2.8 a = “ZR ~ 00274 ~ g/(3600). 
Exercise 2.9 (i) a= < 5g gives Rmin = 567m, (ii) 6mg. 

Exercise 2.10 3.9 x 10° 4, 401m. 

Exercise 2.11 (i) 9.58s, (ii) x = 2396m, (iii) —0.376. 

Exercise 2.12 Given. 

Exercise 2.13 (i) x = moti (tang — tany), y = mot (tan? — tan? y), Z = 


5(tand + tan y). (ii) At the beginning, when both coordinates are infinitesimal, 


y/[x= aw = tan, which means tan y = tan; at the highest point y = 0 and the 


result agrees with Exercise 2.12; while when it lands (y = 0) its vertical velocity 


and slope should be reversed relative to the initial values, that is, tan@ = — tan y. 
2 
Exercise 2.14 (i) h= Rtan G sin! 2), (i) h= &, Gii) 78cm. 
0 


Exercise 2.15 (i) t = 3.1s, Ax = 62m, (ii) t* = 1.53s (using vy(t*) = 0), Ymax = 
y(t*) = 13.5m. 

Exercise 2.16 (i) vo = 40%, (ii) 3°. 

Exercise 2.17 28", ls. 

Exercise 2.18 1.5m ahead. 

Exercise 2.19 Given. 

Exercise 2.20 6 = tan”! 2h 

Exercise 2.21 (i) w= v cos — aca: (ii) No. 

Exercise 2.22 a = tand. 


: . 2v2 cos? O(tand—t wig de 
Exercise 2.23 (i) R= ee (ii) Given. 


Exercise 2.24 Given. 
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Exercise 2.25 Given. 
Exercise 2.26 (i) v) = vov 1+ 3 cos? 0, tan0' = + tand, 
er * _ R gR 
(ii) t* = R/G v cos), y* = 3 (tang ee Raz) 
Exercise 2.27 (i) vio = V1, V20 = Va, (ii) rio = Vit, r20 = Ri + vot. 


Exercise 2.28 Go to the freely falling frame in which gravity plays no role. Find the 
condition for the collision point to have a positive y coordinate in the lab frame. 


PROBLEM SET 3, FOR CHAPTERS 3 AND 4 


Exercise 3.1 (i) d = . (ii) Force on m is mya’ to the right; on the 


F 
my +m+ms 
string is mya’ to the left, (mı + ms)a' to the right; force on m is (mı + m,)a’ 
to the left, F = (mı + m + m;)a' to the right. (iii) a” = ee force 
on m; is m(a” + wg) to the right; on the string is m,(a’ + ug) to the left, 
m(a” + wg) + msa” to the right; force on m is m; (a” + ug) + m;a” to the left, 
F= (m+ m + m;)a" + (mı + m )ug to the right. You do the numbers. 

i t2 ao 

Exercise 3.2 h = Zg (ao —g), T= a (1 +/1—(g/ao)). 
Exercise 3.3 Given. 


Exercise 3.4 (i) a= g(a) m > usM. (ii) Collision speed is v 2ad at time 


t= Ne where a is as given in part (i). 

Exercise 3.5 (i) a= $ to the right, (ii) tension on string connected to the 3kg is 
19.6N, and the other tension is 13N. 

Exercise 3.6 (i) The spring expands, pushing the mass out, and then loses contact 
when the mass crosses x = 0 because the spring can only push but not pull. (ii) Ini- 


tially x(t) = —A cos (Vz ). When x(t) = 0, the mass is no longer in contact with 
the spring. This happens when the cosine vanishes: at to = 5 rE; . The velocity at 


this instant is (tọ) = A t = v. (iii) x(t) = v(t — to). (iv) Using v? — v = 2ax 
Ak 

2umg* 

Exercise 3.7 (i) The spring will not move since it exerts equal and opposite forces 


and a= — ug, we get x = 


on the masses, and they in turn exert equal and opposite forces on the spring. The 
masses will have equal and opposite velocities and positions by symmetry. (ii) A 
displacement 6 from equilibrium of each mass translates into a distortion by 26 
for the spring. So the force divided by ô will be 2k. (iii) As long as the spring is ex- 
panding and pushing the masses out, each will obey the solution of Eqn. 4.8 with 


a suitable A and force constant 2k. The velocity follows from v = x and reaches a 


peak v= +A,/ 2k after a quarter cycle of the oscillator, at time t = z ,/ gf. (iv) Af- 
ter that time the masses will have no contact with the spring. 
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Exercise 3.8 T) = 22° — 49N, To = 28S S/T) = 85N. 


sin(a+b) sin(a+b) 
; _ (m-m)g _ (m-m)g : 
Exercise 3.9 a, = maem Pe eee sie unless the masses are equal, 
one a is positive and the other negative. (ii) T = ee pointing up. 


Exercise 3.10 3.9s later. 


: _ (2m—4m)g — 3m mg 
Exercise 3.11 a; = amem, A = ane 
: : — 4mm +: _ (m — m)? 
Exercise 3.12 (i) me = mam? (ii) me — (mı + m) = =e SU: When 


one mass vanishes, the other falls freely under gravity and is not really supported 
by To. 

: : _ 4mm3—m,(m+m3) 7s: o _ „2m-—m : 
Exercise 3.13 (i) qi = 8 Imm +m (m4m) (ii) When m = M3, 4 = SIm +m > which 
is like a simple Atwood machine with mass 2m on the right. 


Exercise 3.14 Given. 

Exercise 3.15 (1) Amax = ger”) (ii) Amax = (utp) gt M) 

Exercise 3.16 When you use v = 2\ald to find a and from it F, the answer is 
1.5 x 104N. 

Exercise 3.17 (i) 1.32m, (ii) 1.27m, (iii) 1.2m, (iv) Amax = 40.25. 


Exercise 3.18 9 = tan! ; toward the back of the bus. 


Exercise 3.19 (i) ke = ky + ky, (ii) ke = Ate. 
2 

Exercise 3.20 us = Cae, 

d 242 
Exercise 3.21 (i) 0 = tan! [= |, (ii) 13.9m. 
Exercise 3.22 (i) zait, (ii —6mg. 
Exercise 3.23 SY = — i = —0.0035. 
Exercise 3.24 (i) F, = m(gsin@ — 8), Fy = 0, (ii) vo < 2./Rg. (iii) At the bottom, 
p2=4, 

2 


Exercise 3.25 (i) Mg = T= m f= $y Me, (ii) The time period is 1/f. 
Exercise 3.26 f = 3- |R 


Exercise 3.27 (i) v(t) = q Pmr (ii) 00) = z n [1 + £R". 

Exercise 3.28 Given. 

Exercise 3.29 Vmax = /gR = 14”, Fy = 2mg = 1.96 x 10*N. 

Exercise 3.30 Fe = Siang = 342N. Weight is 216N. 

Exercise 3.31 3mg downward. 

Exercise 3.32 (i) M will slide downhill from rest if m < M(sina — cosa), while 
(ii) for m > M(sina + wscosa) the mass M will go uphill. If m < M(sina — 
ukcosa), and M is given a tiny push downhill, it will keep going down, while if 
m> M(sina + uxcosa), and M is given a tiny push uphill, it will keep going up. 
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PROBLEM SET 4, FOR CHAPTERS 5, 6, AND 7 


Exercise 4.1 W = 2 Ax9/?, 

Exercise 4.2 (i) 2, (ii) 1, (iii) 1. 

Exercise 4.3 (i) 1, (ii) 38. 

Exercise 4.4 (i) No, (ii) Yes, U = —x’y’, (iii) Yes, U = — cosx + sin y, (iv) Yes, 
U = — sinh xcosh y. 

Exercise 4.5 2180 days. 

Exercise 4.6 (i) 5.17%, (ii) 6.4m. 

Exercise 4.7 (i) 5R, (ii) v = y/5gR, (iii) T = 6mg, (iv) 3%, (v) A= SMR 2.9m. 
Exercise 4.8 v? = 2gh(1 — ug cot 5 

Exercise 4.9 Given. 

Exercise 4.10 At a distance d = 2/h(H — h), independent of g. 

Exercise 4.11 18d. Notice the answer is independent of m, k, and g! 

Exercise 4.12 (i) mg, (ii) N = mg, (iii) ag = g, ar = Beg, 


Exercise 4.13 (i) 9 = cos! 4, (ii) 9 = cos! (3 + i). 


Exercise 4.14 (i) x = 0, (ii) x=+,/ 41, (ii) œ = ,/ “4. 
Exercise 4.15 (i) x = n,an integer, (ii) œ = 27,/ A 
Exercise 4.16 (i) 3, (ii) 3, and (iii) 3. (iv) U(x, y) = ——4-, (v) 5 and 55. 


Exercise 4.17 v = 5gR. 

Exercise 4.18 9.773. 

Exercise 4.19 6 x 10!°s ~ 200 million years. 
Exercise 4.20 1.1 x 10°. 

Exercise 4.21 42, 000km. 

Exercise 4.22 v= ./2GMg/r. 

Exercise 4.23 Given. 

Exercise 4.24 7.954”, 


PROBLEM SET 5, FOR CHAPTER 8 


Exercise 5.1 (i) At the origin (0,0), (ii) (—4, 4), (iii) (— 2,0). 
Exercise 5.2 4 above the base. 
Exercise 5.3 (i) (3.2cm,0), (ii) (0,0). 


Exercise 5.4 Given. 


. . as m3 oe — (m +m )L aes = m . 
Exercise 5.5 (i) V = mam (ii) t= ACES TESTE (iii) d = aS L, (iv) 
— m 
Xem = m+m+m 


Exercise 5.6 (i) a = 37.6°, (ii) v = — p vb cosa = —0.66 7. 


492 Answers to Exercises 


Exercise 5.7 v4 = =a) vg = xve = x, 

Exercise 5.8 vı = —v ti, V2 = VY (4) (Bi+ 1) V3 = Vo (4) (Bi- 1). 
No friction means all forces are normal to the spheres. 

Exercise 5.9 H = h (24M)? , H= 10.1mm. 

Exercise 5.10 Given. 

Exercise 5.11 (i) vo #, (ii) 4vm( — m/M). 

Exercise 5.12 (i) 20%, (ii) 100%. 

Exercise 5.13 (i) 0.4%, (ii) 180J ~ initial energy of the bullet (neglecting the tiny 
kinetic energy of the two masses). 

Exercise 5.14 (i) 1.24 x 10”, (ii) 2.29 x 107}°J. 

Exercise 5.15 (i) v’ = 6.2 (icos23.8° + jsin23.8°)™, (ii) no. 

Exercise 5.16 (i) —2km, (ii) 504%, (iii) 50M, (iv) 5.8 x 1057. 


Exercise 5.17 (i) V = ere S Gi) v™ = miin) = —(v)™, 1 < 2 for m. 


Note that the speed of each particle is the same before and after the collision. 


(iii) The total momentum, which is the CM momentum, is zero in the CM frame. 

Exercise 5.18 v4 = -4 vg = am 

Exercise 5.19 (i) 0, (ii) 3, (iii) 4. 

Exercise 5.20 a = Bit Vj. 

Exercise 5.21 0.31s. 

Exercise 5.22 28,479kg. 

Exercise 5.23 2,197. 

Exercise 5.24 x = e — 1 = 1.718... 
In3 


Exercise 5.25 vr = Yhn?’ 
2 


PROBLEM SET 6, FOR CHAPTERS 9, 10, AND 11 


Exercise 6.1 (i) 0.104, (ii) 665, (iii) a, = —6.984. 

Exercise 6.2 IMR. (Think ofit as a stack of discs of radius R.) 

Exercise 6.3 (i) ima, (ii) ima, (iii) ma’. 

Exercise 6.4 (i) Icorner = 3Ma’, (ii) Icm = 4 MP. 

Exercise 6.5 (i) ml, (ii) t = ™®, (iii) N= Z2, (iv) 18.8kg- m2, 12.5s, 2,000. 
Exercise 6.6 Given. 

Exercise 6.7 (i) Icy = 2MR’, (ii) Tedge = EMR. As p (r) increases with r, the mass 
is distributed farther out from the center than in the uniform case. Adding MR? to 
move the axis to the circumference only increases I, which is always a minimum 
with respect to the CM. 

Exercise 6.8 Given. 

Exercise 6.9 A x B = i(AyB, — AzB,) + j(AzBy — AxBz) + k(A,B, — Ay By). 
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Exercise 6.10 (i) A x B is the area of the parallelogram with sides A and B. (ii) (A 
x B)- Cis the volume of a parallelepiped with edges parallel to A, B, and C. 
Exercise 6.11 Given. 

Exercise 6.12 (i) L= —7k, L=7, (ii) sind = —£. 

Exercise 6.13 (i) imgR, (ii) 5mgR. Notice that r drops out, (iii) R> R—r. 
Exercise 6.14 (i) yg = “SCM me Z 0.31, (ii) y = EHAO = 288. 


(We have assumed 1 > zz. cot, which is true here.) 


Exercise 6.15 u, > 2tand. 
Exercise 6.16 (i) $g (ii) img. 


@0 (ii) 1 
eG (MRE) ? L PON (MR) 
Exercise 6.18 I; ~ 108kg - m? = 10°8kg - m? , Tf = 10745; 


Exercise 6.19 (i) a= 9 rad, (ii) W = 54J. 
Exercise 6.20 l = R ae = 


Exercise 6.21 (i) a = Taine’ (ii) @ (5) = Tin’ (iii) Friction is 
along the rod and does not contribute to torque. 

Exercise 6.22 w = oe K= EMR og 

Exercise 6.23 H = R. 


Exercise 6.24 Given. 


Exercise 6.17 (i) œ = 


= 2.3m. 


. sy 2 2) 2R see: 202 R? 
Exercise 6.25 (i) $00, 3 Tg (iii) d= Bi 
Exercise 6.26 (i) (1 + F 1 Gi) h= o 


c= 2¢(m+M)(1+(4M/3m)) * 
Exercise 6.27 vo = ,/ ee where I = mii + “e is the moment of inertia of 
the rod and putty. 


nae 


p ` mRv 2 R(Ip-+mR2 
Exercise 6.28 (i) œ = hime , (ii) b sol , (iii) T = ree 
Exercise 6.29 a= L0"), 

m+m+zM 


Exercise 6.30 (i) a = 2.35, (ii) T2 = 24.2N, T4 = 30N, (iii) 4.3%. 

Exercise 6.31 (i) F = 346iN, (ii) Fp = (—346i+ 400j) Newtons. 

Exercise 6.32 (i) Fp = (—400i+ 200j) Newtons, (ii) Fp = (—500i+ 300j) Newtons. 
Exercise 6.33 u, = 45, 


Exercise 6.34 (i) T = Cnt 2M — = 672N, F=i(M+4m) gcotd +j% = 609i + 


39.2j Newtons. Notice that F, is independent of M! (ii) œ = £ ( 274). 


Exercise 6.35 (i) T = 1L Rogo = 1250/3N, (ii) F = iT cost) + 
j(—T sin0, + (m+ M)g) = 625(./3i + 1125)). 
Exercise 6.36 (i) Get a kid > weight 70N and place her at the right end, (ii) 1m 


right of the midpoint, (iii) $ =< m from the 100N kid. 


494 Answers to Exercises 


PROBLEM SET 7, FOR CHAPTERS 12, 13, 14, AND 15 


Exercise 7.1 In one second light travels a distance of 1 light-second by definition. 
Dividing this distance by 1s, we get, in these units, velocity c = t = 1. A light- 
second is 3 x 108m. 

Exercise 7.2 —7.73 min, that is, B occurred before A, (ii) 14.4min. 

Exercise 7.3 (i) 2.6 x 10°, (ii) 9.7min, (iii) (As)? = 25(light-min)* = 9 x 
104 (light-seconds)*. 

Exercise 7.4 (i) 95.8km, (ii) 594m, (iii) 2 x 10~®s. In fact all lifetimes are proper 
times, that is, given in the particle’s rest frame. 

Exercise 7.5 Its lifetime in the earth frame is 1.56 x 10~‘s, and the time to travel 
30km at (almost) the speed of light is 1074s. 

Exercise 7.6 * = .999456 

Exercise 7.7 (i) 0.6m, (ii) 2.5ns, (iii) 1m/.8c = 4.17ns. 

Exercise 7.8 (i) 80m, (ii) 444 x 107s (iii) 5.55 x 1077s, 
(iv) 3.33 x 1077s, (v) 1.66 x 1077s, (vi) 17,983 m°. 

Exercise 7.9 (i) Since v 1 — 0.8? = 0.6, the Lorentz contracted length of the train is 
Lo x 0.6 = 200 x .6 = 120m = L, which is the length of the tunnel. (ii) FF: (x, t) = 
(120m, 0.5005), (x,t) = (0,0.300us), BB: (x,t) = (0m,0.500us), (x,t) = 
(—200m, 0.833 us). (iii) —1.44 x 104m”, easiest to compute in the tunnel frame 
where only Ax £0. 

Exercise 7.10 (i) x= light - seconds, (ii) tọ = 2s. 

Exercise 7.11 (i) “ = 0.4. (ii) Yes, because there is not enough time for a light signal 
to go from the earlier to the later event (cAt = 0.4A x). (iii) s* = —7.56 x 10°m?. 
(iv) There is no such observer: if a light pulse cannot make it to both of the events, 
no one can. 

Exercise 7.12 (i) 8yrs, (ii) 4,/3yrs, Gii) 2/3 lyrs, (iv) (x = 4lyrs,t = 8yrs), 
(v) (x =0,ť = 4,/3yrs). The rocket is at the origin of its own coordinate sys- 
tem all the time. 

Exercise 7.13 8.6yrs. 

Exercise 7.14 (i) —1.49 x 10°, (ii) Red occurs 4.25 us earlier. 

Exercise 7.15 P = mc (cosh@, sinh@). 

Exercise 7.16 (i) ie (ii) 0.5493, 2 x 0.5493, 3 x 0.5493, (iii) cosh(3 x 0.5493) = 
2.6943. 

Exercise 7.17 (i) Given. (ii) M = M(0') - M(@) = M(6 +6’), where 


—sinhO  cosh@ 


uo=| coshO —sinhd l 
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Exercise 7.18 8 (t) = sinh! H, 
Exercise 7.19 /17 mpc? = 3.867 GeV. 
Exercise 7.20 0 = 8.21. 
Exercise 7.21 (i) 1.6c, (ii) because the limit on relative speed of two objects in any 
frame is 2c, (iii) 0.9756c. 
Exercise 7.22 21. 
py? 2 


: (m—m*)*c 
Exercise 7.23 aa 


Exercise 7.24 M = ‘esa +y),p=movy. 
Exercise 7.25 V =v; T . The second route is a lot easier. 


. = cal 
Exercise 7.26 * = F 


Exercise 7.27 Given. 


Exercise 7.28 Given. 

Exercise 7.29 (i) Ez = ¿mge = 1.784, °, Pr = 1.476mz c, (ii) Or = 1.182. 
Exercise 7.30 (i) 0 = 2.634, (ii) 0 = 1.317, (iii) Ox = 2.634 in any frame. Chang- 
ing frames only adds a constant to each rapidity, leaving the difference unaffected. 


(This is true of low velocities.) 
Exercise 7.31 M = 10.72 GeV. 
Exercise 7.32 w = 67.5MeV,k = +67.5MeV / c. 


PROBLEM SET 8, FOR CHAPTERS 16 AND 17 


Exercise 8.1 The phase of z will be 4 because i has a phase of +, and zı with equal 
real and imaginary parts has a sage of 4. Since zï has the phase opposite to ea 
of zı, dividing by it will advance the phase of iby 7 to = = Z, Finally, n =|z|= -5 


J 
Exercise 8.2 (i) z = $ — Ši, |z| = z, f= 26 + 41), z = HE + 4i), 
(ii) z = —7 + 24i, |z| = 25, z* = (3 — 4i? = -7 — 24i, z! = — (7 + 
24i), (iii) z= -4 + $i |z| = 1, Z = —4 (7+ 24i), z7! = p 


(iv) z= ES + Ai, a = G, 2 = (0-9-43). z 
j (0- V9- i143). 

izja ze ~in/4 _ Li, |z|=1, g =2e/6 3 — i, a= 
2e'*/6 = /3 +i, |z| =2, (ii) zz = 2e7/, 2 soon, 


2 


-1 ¥243V/3 , 
Exercise 8.4 zı +z = ,/40+ a (1+4/3) etn e = 7.95 ¢0-982i, 


Exercise 8.5 cos46 = cos*@ — 6 cos? 0 sin? 0 + sint 4, sin 40 = 4cos@ sin (cos? 8 — 
sin? 0). 

Exercise 8.6 Given. 

Exercise 8.7 fi (x) = x+ 3x? + 6x +.. h(x) = x+ Ea + Ze +... 

Exercise 8.8 (i) F(x) = a + a +...; (ii) because f(x) is even while x° is odd. 


Exercise 8.3 (i) z =e 
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Exercise 8.9 (i) wlll +x—5x?+...), (ii) 1+ x4 42° +...), (iii) n2+ ix 
gx E kes 

Exercise 8.10 (i) T = 27 Jia +£...) (ii) 1.7%. 

Exercise 8.11 @ = / 20. 

Exercise 8.12 (i) After 4s. (ii) The slow one has completed one cycle, and the fast 
one four. 

Exercise 8.13 (i) 10s, (ii) 5 and 2 cycles. (iii) Since the period is independent of 
the amplitude, the answers do not change. 

Exercise 8.14 (i) ô = 3.362 rad, lies in the third quadrant where cosd < 0, 
(ii) f = 1.02Hz, (iii) k= 109%, (iv) m= 2.7kg. 

Exercise 8.15 The mass oscillates as without gravity but about the point xp where 
the force of gravity balances the spring force. 

Exercise 8.16 (i) It will oscillate with A = 2cm. (ii) The net force will be zero when 
it drops by 2cm, at which point kx = mg. (iii) T = 0.28s. 

Exercise 8.17 A = nce ; 

Exercise 8.18 0.67kg, 9.5 x 104%. 

Exercise 8.19 When x = +A, 4, > aoe 


Exercise 8.20 Given. 


LMT 
‘ _ ef fl m 
Exercise 8.21 0.9 = aj Ty DMA m leads to m = 0.039. 


Exercise 8.22 (i) T = 2.8, (ii) œ = 1™, (iii) Omax = 26°. 


Exercise 8.23 f = st EST 


Exercise 8.24 Given. 


Exercise 8.25 Given. 

Exercise 8.26 It is quadratic in 0, and we drop all non-linear terms in the small 
angle approximation. 

Exercise 8.27 Given. 

Exercise 8.28 Given. 

Exercise 8.29 v = 157. 

Exercise 8.30 (i) 0.05m, (ii) œo = 14.1%, (iii) 0.35m. 

Exercise 8.31 (i) The minimum of |Z| and |Z|? occur at the same œ. (ii) The ve- 
locity has the maximum amplitude at œ = wo. 

Exercise 8.32 T* = 222. 


Wd 
-11 
cos(2t—tan 1t) 


‘ _ 4 
Exercise 8.33 xp(t) = ïg + TE : 
Exercise 8.34 (i) x(t) = 2.02e~‘ cos(7.94t — 0.125), (ii) xp(t) = 0.106 cos(16t — 
3.31), (iii) x(0) = —0.102, v(0) = 0.271. 
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PROBLEM SET 9, FOR CHAPTERS 18 AND 19 


Exercise 9.1 17m, 1.7cm. 

Exercise 9.2 (i) w(x, t) = (0.4) cos(¥Æx — 347 t), (ii) u = 207 A?f? = 18.34, 
(iii) (P) = u- v= 109.5W. 

Exercise 9.3 2136m. 

Exercise 9.4 2”. 

Exercise 9.5 (i) A = 0.5, (ii) f = 30Hz, (iii) Vmax = 307, (iv) amax = 180077. 
Exercise 9.6 12.65. 

Exercise 9.7 (i) 1.364, (ii) 68.2W. 

Exercise 9.8 v = ./gL. 

Exercise 9.9 t = E = Ta = ig = 0.09s. 

Exercise 9.10 W, /W2 = 4. 

Exercise 9.11 fo = 2fp. 


Exercise 9.12 (i) t= Tan = Vm (ii) t= 2(,/L(m + M)/mg — ,/ML/mg). 
(If m= 0, the mass per unit length will vanish, as will t.) 

Exercise 9.13 m= me = 19.6kg, M = =" = 27.7kg. 

Exercise 9.14 10°! = 1.26. 

Exercise 9.15 The nearer point is at r= cesT 
Exercise 9.16 (i) P=0.13W, (ii) r= 100m. 
Exercise 9.17 Using fhigh/low = Sar ie 600 + 1 gives A = 2.2cm. 
Exercise 9.18 568 Hz — 635Hz. 

Exercise 9.19 (i) Vm = Vs X $ = 0.83", (ii) A = 0.53m. 

Exercise 9.20 We want a node at the eardrum and an antinode at the opening. So 
the canal is a quarter wavelength and f = 330% / (4 x 0.03) = 2750 Hz. 

Exercise 9.21 (i) 431.3 Hz, 1294Hz, (ii) 862.5Hz, 1725Hz. 

Exercise 9.22 8 antinodes. 

Exercise 9.23 (i) and (ii) 825Hz, (iii) 413 Hz. 

Exercise 9.24 (i) 1.1 x 10° ™, (ii) L = 3.42m, (iii) 4 = 1.71m, (iv) x = 
.04 sin(3.67x) cos(4020f). 

Exercise 9.25 (i) The modes must have 2 = 2L,2L/2,2L/3,... or f = fo,2fo,3fo.... 
So 2fo = 500Hz. T = (2H) L? u = 62.5N. (ii) In that case, fp = 125Hz and the 
resonant frequencies would have been 125,250, 375,500,625, 750, 875Hz and the 
fifth harmonic at 625.Hz should have been picked up. 


2 2 
Exercise 9.26 (i) @ = sin™! | “4 i], n=1,2,..., (ii) n= 3,0 =35.5°, n= 4,0 = 


n? mg 
19.1°, n = 5,0 = 12.1°. The integer n counts the number of half-wavelengths. 


Exercise 9.27 (i) 318.5%, (ii) 323.6%. 


= 3.42m. 
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Exercise 9.28 (i) maxima at 0.0cm,+25cm,+50cm,+75cm,+1m from the mid- 


point, (ii) minima at £12.5cm,+37.5cm, +62.5cm,+87.5cm from the midpoint. 
Exercise 9.29 (i) 1.81m, (ii) +51.4°, (iii) path difference cannot exceed d, 
(iv) +(11.3°, 35.9°, 77.8°). 

Exercise 9.30 (i) 20 sin(.52 x) cos(40z t), (ii) 2m, (iii) v= —2.51 x 10° sin(40z t) aa 
a= —3.16 x 10° cos(40z t) 4 


2° 


Exercise 9.31 Make the hole at (100, 32). 


PROBLEM SET 10, FOR CHAPTER 20 


Exercise 10.1 455 Pa. 

Exercise 10.2 1.2 x 104 Pa. 

Exercise 10.3 (i) z= 5h, (ii) z= $f, (iii) Rmax = h. 
Exercise 10.4 (i) Given. (ii) ay = /2gdi wò. 
Exercise 10.5 (i) d= ae fae. 

Exercise 10.6 3.5. 


: PON m 
Exercise 10.7 T = 27 | pgĀ' 


Exercise 10.8 v, A; = ae =8.1 x io, 
2 1 


Exercise 10.9 9.3 x 104N. 

Exercise 10.10 196kN. 

Exercise 10.11 (i) 229.3N, (ii) 223. 
Exercise 10.12 A = 9.8 x 1073 m’. 
Exercise 10.13 Given. 

Exercise 10.14 (i) 4, 900N, (ii) 4cm, (iii) 1. 
Exercise 10.15 v= 146%. 

Exercise 10.16 (i) C= Z, (ii) y= a 


2 
Exercise 10.17 d = 2mm. 


Exercise 10.18 (i) P= pgh = 1.2 x 10° Pascals, (ii) F = PA = 9.2 x 10*N, which 
is roughly like a 10, 000kg person standing on the lid. 

Exercise 10.19 py/p, = 4. 

Exercise 10.20 V = 94m’. 


PROBLEM SET 11, FOR CHAPTERS 21, 22, 23, AND 24 


Exercise 11.1 3,030kK/J. : 

`: —l_mM_⁄) _7o0 
Exercise 11.2 AT = 5 Game e = 7°C. 
Exercise 11.3 75g ice and 625g of water at 0° C. 


Exercise 11.4 Water at 4.7° C. 
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Exercise 11.5 0.015. 

Exercise 11.6 1.26kg of water and 0.14kg of ice, all at 0°C. 

Exercise 11.7 cy ~ 21807x- 

Exercise 11.8 To = Sikt = 60°. Thus Ty — Tr = 60, Tr — Tọ = 20: the blocks 
share the “drop” Tp — Ty in the inverse ratio of their thermal conductivities, just 
like resistors in series share the voltage drop. In this analogy x is like conductance 
and 1/x is like resistance. In electrical circuits the current is a constant, and here 
the heat flux a is. 

Exercise 11.9 1.1 x 10°J. 

Exercise 11.10 42 = —101W. 

Exercise 11.11 8,500W. 

Exercise 11.12 Given. 

Exercise 11.13 2,030 moles. 

Exercise 11.14 Vf = 1.4 x 1074m’. 

Exercise 11.15 V ~ 0.022m? = 22L. 

Exercise 11.16 (i) T = 2.44 x 10°K, (ii) 5.04 x 107!8J, (iii) vp = 14,7007. 
Exercise 11.17 (i) W = 400J, (ii) Uc = 2-450] = 900J, (iii) AQ = 850J, 

(iv) and (v) 1:2. 

Exercise 11.18 (i) 7 = (Ta —T.)/ (T, + aa} (ii) Given. 

Exercise 11.19 7 = 1 — r!” . 

Exercise 11.20 7 = % = jay = 0.182. 

Exercise 11.21 (i) and (ii) 555J. 

Exercise 11.22 387K. 

Exercise 11.23 (i) 2.4%, (ii) 6.7°. 

Exercise 11.24 (i) 483. (ii) 4 x 483 = 1,932 %. (iii) The H) molecule is about six 
times as fast as sound. 

Exercise 11.25 (i) P(v) = „/ -22° v?e—™™”/?47, (ii) 1520. 

Exercise 11.26 2.1 x 10°Pa. 

Exercise 11.27 (i) Pc = 250kP, (ii) Wa+g = —1,000J,Wp+c = 0,Wes4 = 
2,012], (iii) AQ = total work done = 1,012). 

Exercise 11.28 AC is adiabatic and AB is isothermal (less steep). (i) Point A: 
(50kPa,25L,150K), Point B: (250kPa,5L,150K), Point C: (427.5kPa,5L,257K). 
(ii) Wasp = —2,007J,Wasc = 0,Wosa = 2,655).Wapca = 648) = 
heat absorbed. 

Exercise 11.29 AS = 0.461 2. 

Exercise 11.30 AS = 3.2%, 


Exercise 11.31 (i) AS = Rin3, (ii) FL = 3M4 = 1010”, (ili) AS = 0 because 
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Po Vo(1—3"1-?)) 


AQ = 0 and the process was adiabatic and reversible, (iv) W = Jai j 


(v) AU =—W. 

Exercise 11.32 (i) ņ2 = l1 — P as if the middle reservoir was not there, 
(ii) 712 = M10 + (l — m0)MN02- 

Exercise 11.33 Reverse the Carnot and double its size. It now runs as a refrigera- 
tor, takes 120J from T>, and with 80J of work input delivers 200J to Tı. This 80 
of work comes from the so-called “better-than-Carnot” engine. The sole effect of 
the combination is to transfer 100J from T to T. 

Exercise 11.34 Given. 

Exercise 11.35 (i) T = 288.6K, (ii) ASc, = —9904, (iii) AS, = 1,1614, 
(iv) A Suniverse = 1714. 

Exercise 11.36 (i) A: (6 x 10° Pa, 0.0221m>;800K), B: (2 x 10° Pa, 0.0664m°, 
800K), C: (10° Pa,0.10897>,656K), D: (10°Pa,0.0221m°,133K), (ii) Wag = 
14,590], Wgc = 5,965], Wop = —8, 681), Wpa = 0; W = 11,873], 

(iii) Qag = 14,590], Qgc = 0, Qcp = —30, 382), Qpa = 27,667), Q = 11,873] = 


W; so that AU = 0, (iv) 7 = gaz = 0.280. 


Constants and Other Data 


m 
c=3 x 10°—, speed of light 
s 
Mg = 5.97 x 10%*kg, mass of the earth 
Ms = 1.99 x 10°°kg, mass of the sun 
Mm = 7.35 x 10” kg, mass of the moon 


Rg = 6.37 - 10° m, mean radius of the earth 


N- mê 
G=6.67- 107" EI gravitational constant 
8 


m : F 
g=9.81 P acceleration due to gravity 


Me = 9.11 x 107°} kg, electron mass 


mp = 1.67 x 107? kg, proton mass 


k= 1.38 x 107%” z Boltzmann’s constant 


R=8.31 


J 
» gas constant 
K- mole 


1 calorie = 4.18] 


Na = 6.02 x 10 molt, Avogadro’s number 
k 
Pa= io density of air 
m 
kg 
Pw=1, 000—, density of water 
m 


k 
Lp = 3a, latent heat of melting of ice 
& 


k 
Lyap = 2,257, latent heat of vaporization of water 
& 


Cy = 4,184 J , specific heat of water 
kg: K 


501 


This page intentionally left blank 


Index 


Y = Cp/Cy, 392 


absolute future, 229 
absolute past, 229 

absolute zero, 360 
acceleration, 6 

adiabatic processes, 396 
amplitude, 54 

angular momentum, 150 
angular velocity, 30, 146 
anti-nodes, 332 
Archimedes’ principle, 343 
average acceleration, 5 
average energy density of wave, 318 
average velocity, 5 
Avogadro’s number, 378 
axes, rotations of, 22 


ballerina, 173 

ballistic pendulum, 138 
barometer, 339 

basis, 20 

basis vectors, 22 

beat, 325 

Bernoulli’s equation, 346, 348 
Bevatron, 252 

Big Bang, 384 

Boltzmann weight, 439 
Boltzmann’s constant, k, 378 


calculus review, 82 

caloric fluid, 361 

calories, 361 

Carnot, 401 

Carnot engine, 403 

causality and relativity, 227 
center-of-mass coordinate or the CM, 120 
centripetal acceleration, 31 


circular motion, 29, 64 

CM momentum, 129 

CM of nonuniform rod, 125 

CM of planet and sun, 130 

CM of rod, 124 

CM of triangle, 126 

coefficient of kinetic friction, 57 

coefficient of linear expansion, 363 

coefficient of static friction, 57 

collision, totally elastic, 137 

collision, totally inelastic, 137 

collisions, 136 

complex-conjugate of z, 269 

complex number: amplitude and phase, 
272; imaginary part, 269; modulus of z, 
271; polar form, 272; real part, 269 

complex number z, 269 

components of vector, 19 

Compton effect, 249 

conduction, 369 

conservation of energy, 75 

conservative and non-conservative 
forces, 92 

conservative force, 95; test for, 97 

conservative forces, 81 

constant acceleration, 6 

constants, 500 

constructive interference, 328 

continuity equation, 346 

convection, 369 

Copernicus, 101 

coupled masses, 61 

crest, 314 

cross product, 183, 184 


damped oscillations, 291 
decibels, 319 

density, 335 

destructive interference, 328 
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differential equation, 52 
dimensionless velocity 6 = 4, 233 
displacement, 18 

Doppler effect, 320 

Doppler effect, transverse, 322 
dot product, 88, 89 

dot product of four-vectors, 239 


dot product of two four-vectors, 239 


double slit experiment, 326 
dynamics, 2, 3 


efficiency, 402 

energy, 70 

energy density of wave, 318 
energy-momentum, 246 
entropy, 401, 412 

equation of state, 378, 385, 433 
ether, 201 

Euler’s formula for e, 266 
event, 197 

exhaust velocity, 134 


first law of thermodynamics, 388 
fluids, 335 

force, 36 

force constant, 42 
four-momentum, 241, 243 
four-momentum of photon, 247 
four-vector, 238 

four-velocity, 242 

Fourier series for oscillator, 300 
free expansion, 421 

free-body diagrams, 46 

friction, 54, 56 

fundamental frequency, 333 


Galilean transformation, 198 

gas thermometers, 358 

general theory of relativity, 44, 104 
geometry of spacetime, 232 
geosynchronous satellites, 111 
Gibbs, 437 

gravitational mass, 44 


gravitational potential, choice of constant, 


114 
gyroscope, 191 


harmonic oscillations, rotational, 
280 

harmonic oscillator phase, 278 

heat, 361 

heat current, 371 

heat reservoir, 369 

heat transfer, 368 

homogeneous, linear differential 
equation, 286 

hydraulic press, 341 

hyperbolic tangent, 236 


impact parameter, 142 
inclined plane, 57 
inertial frame, 37, 195 
inertial mass, 44 
inertial observer, 37, 195 
initial coordinate, 8 
initial value data, 278 
initial velocity, 8 
instantaneous velocity, 5 
interference maximum, 
328 
interference minimum, 328 
internal energy U, 387 
invariant, 25, 232 
isobaric process, 396 
isotherm, 390 


Kelvin scale, 361 

Kepler’s laws, 101 

kinematics, 2, 3 

kinetic and static friction, 54 

kinetic energy, 71 

kinetic energy, translations plus 
rotations, 165 

kinetic energy for N bodies, 
163 

kinetic theory, 381 


ladder, in equilibrium, 180 
Large Hadron Collider, 252 
latent heat of fusion, 366 

latent heat of vaporization, 366 
law of conservation of energy, 70 
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Index 


law of conservation of momentum, 
128, 129 

law of inertia, 36 

law of universal gravitation, 45 

length contraction, 220 

light-like intervals, 234 

line integral, 91 

linear homogeneous transformation, 211 

linearly dependent, 287 

long-range order, 382 

loop-the-loop, 66 

Lorentz transformation, 204, 206 


major axis, 102 

mass, measurement of, 40 
Maxwell-Boltzmann distribution, 383 
mechanical energy, 76 

mechanical equivalent of heat, 373, 376 
microstates, 424 

mixed or cross derivatives, 85 

molar specific heat, 391 

mole, 378 

moment of inertia, 150 

moment of inertia, computation, 156 
momentum, 128 

mono-atomic gas, 392 


neutrino, 70 

Newton’s laws, 36, 38, 45 
nodes, 332 
nonconservative force, 94 
normal, 56 

null vector, 18 

number of moles, 378 


orbits in gravity, 108 

oscillations: critically damped, 294; 
over-damped, 291; under-damped, 292 

oscillator: complimentary solution, 300; 
driven, 294 


paradoxes in relativity, 222 
parallel axis theorem, 159, 160 
partial derivative, 84 

partial differential equation, 309 
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particular solution, 300 

partition function, 439 

Pascal, 336 

past, present, and future in relativity, 227 
phase change, 365 

physical pendulum, 282 

Planck distribution, 384 

position vector, 21 

postulate of statistical mechanics, 428 
potential energy, 76 

power, 72 

precession of gyro, 193 

precession of the perihelion, 104 
pressure, 335 

principle of superposition, 287, 323 
projectile motion, 32 

projection, 19 

proper time, 240 


quasi-static process, 386 


radians, 30, 145 

radiation, 368 

rapidity, 236 

reference intensity, 319 
relativistic velocity transformation law, 213 
relativity postulates, 203 
resolving the force, 59 
resonance, 299 

rest energy, 245 

reversible, 387 

right-hand rule, 183 
rigid-body dynamics, 143 
rigid-body dynamics in 3d, 182 
rockets, 134 

rolling without slipping, 166 
Rutherford scattering, 140 


scalar multiplication, 19 

scalars, 19 

scattering in 3d, 140 

second law of thermodynamics, 399, 418 
simple harmonic motion, 275 
simultaneity, 214 

space-like intervals, 234 
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spacetime, 197 

spacetime interval, 233 

specific heat, 362 

specific heat at constant pressure, Cp, 392 
specific heat at constant volume, cy, 391 
stable equilibrium, 275 

standing wave, 332 

state variables, 394 

static equilibrium of rigid bodies, 175 
static friction, 57 

statistical mechanics, 423 


Taylor series, 255; (1 + x)”, 261; cos x and 
sin x, 264; e*, 263; explicit form, 261 

Taylor series for eX, 265 

temperature, 354 

tension, 47, 48 

thermal conductivity, 371 

thermal equilibrium, 352 

thermometers, 355 

third law of thermodynamics, 441 

time dilatation, 216 

time-like intervals, 234 

torque, 151 

torsion constant, 280 

troughs, 314 

tunneling in quantum theory, 99 


Index 


twin paradox, 219 
two-body problem, 118 


unit vectors, 19 

universal gas constant, R, 379 
universal gravitational constant, G, 107 
unstable equilibrium, 275 


vector transformation law, 24 
vectors, 18 


watts, 73 

wave equation, 308, 310 

wave intensity, 318 

wave number, 310 

wavelength, 313 

waves, 303; energy and power, 316; 
interference, 323; longitudinal, 304; 
superposition of, 323; transverse, 304 

weight and weightlessness, 49 

work done by the force, 71 

work done in d = 2, 84 

work-energy theorem and power, 71 

work-energy theorem with friction, 78 


zeroth law of thermodynamics, 354 


